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The paper deals with the variational principles for evaluation of the spectral 
radii of transfer and weighted shift operators associated with a dynamical system. 
These variational principles have been the matter of numerous investigations and 
the principal results have been achieved in the situation when the dynamical system 
is either reversible or it is a topological Markov chain. As the main summands these 
principles contain the integrals over invariant measures and the Kolmogorov-Sinai 
entropy. In the article we derive the Variational Principle for an arbitrary dynamical 
^ ■ system. It gives the explicit description of the Legendre dual object to the spectral 

potential. It is shown that in general this principle contains not the Kolmogorov- 
Sinai entropy but a new invariant of entropy type — the t-entropy. 
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1 Introduction 

The article is devoted to the investigation of dynamical and metric invariants associated 
with the spectral radius of transfer and weighted shift operators. 

Let us consider a compact space X, and let a : X — > X be a continuous mapping. 
This mapping generates the dynamical system with discrete time which we will denote 
by(X,a). 

Among the operators whose spectral analysis is of prime importance in the dynamical 
systems theory are 

a) the shift operators, that is the operators of the form 

T a f(x) = f(a(x)), feF(X), (1) 

where F(X) is a certain functional space, 

b) weighted shift operators 

aT a f(x) = a(x)f(a(x)), f e F(X), (2) 

where a is a fixed function (weight), (operators (2) are also called evolution operators), 
and 

c) transfer operators (associated with the adjoint operators to weighted shift operators 
(see Definition 2.1 and Example 7.6)) among which the most popular one is the classical 
Perron- Frobenius operator, that is the operator acting in the space C(X) of continuous 
functions on X and having the form 

Af(x)= ^{y)f(vl (3) 

where ip G C (X) is fixed. This operator is well defined when a is a local homeomorphism. 

Apart from the 'pure' dynamical systems theory these operators have numerous appli- 
cations in mathematical physics and in particular in thermodynamics, stochastic processes 
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and information theory, investigations of zeta functions and Fredholm determinants, op- 
erator algebras theory, where they serve as an inexhaustible source of important exam- 
ples and counterexamples so also as key constructive elements of the crossed product 
type algebras, in the theory of solvability of functional differential equations, wavelet 
analysis etc. We refer to books [45, 43, 46, 3, 13, 33, 10, 11, 17] and recent papers 
[28, 44, 31, 9, 26, 27, 8, 24] and the bibliography therein. 

Spectral properties of weighted shift and transfer operators and especially the formulae 
and methods of calculation of their spectral radii are tightly related to the ergodic and 
entropy theory of dynamical systems and variational principles of thermodynamic and 
informational nature. Let us recall in brief the spectral radius 'life story'. 

If a : X — > X is a continuous invertible mapping and m is an a-invariant measure on 
X whose support coincides with X (that is any open set has a nonzero measure), and 
a G C(X), then in the space F(X) = L P (X, m), 1 < p < oo, as well as in the space C(X) 
the following formulae for the spectral radius of weighted shift operator (2) are valid: 



\nr(aT a ) = max / In \a(x)\ dfx, 
jueMc J x 

In r(aT a ) = max / In \a(x)\ du. 



(4) 
(5) 



Here M a is the set of all a-invariant probability measures on X and EM a is the set of 
a-invariant ergodic measures on X. 

The statements of this type are called in the dynamical systems theory and related 
fields of analysis the variational principles and we will come across a number of them in 
the article. 

The variational principles (4), (5) were stated by Antonevich and for a number of 
concrete situations they have been proved, for example, in [1, 2] where one can also 
find the corresponding description of the set M a . In the general form (for an arbitrary 
homeomorphism a) these principles were established by Lebedev [37] and Kitover [32]. 
The applications of formula (5) to the calculation of the spectral radii of various weighted 
shift operators are given in [12, 3, 13]. 

If a is not invertible the results related to the corresponding formulae for the spectral 
radius can be divided into two classes. The first class contains the results referred to the 
case when the weighted shift operators act in the spaces of C(X) or L°°(X, m) type. In 
this case formulae (4), (5) preserve their form (see, for example, [40]). The second class 
contains the results referred to the spaces L p (X,m), 1 < p < oo. Here the situation 
changes drastically. At this point the deep 'entropy' and 'stochastic' nature of the spec- 
trum of weighted shift and transfer operators springs out. Namely a complete description 
of this phenomenon is the goal of the paper. 

The starting principal results for this second class have been achieved by Latushkin 
and Stepin [34, 35, 36] under a rather special assumption on the nature of the mapping a. 
Namely, in the case when a is a topological Markov chain (in particular a can be an 
expanding /c-sheeted cover of a manifold X) they proved the following formula for the 
spectral radius of operator (2) in L P (X, m), 1 < p < oo: 



In r(aT a ) 



= sup ( / 



In \a(x) \ du H — 
x P 



In p(x) d\i + h{\i) 



(6) 
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where h{p) is the Kolmogorov-Sinai entropy of the measure \i with respect to the map- 
ping a and p is a certain continuous nonnegative function defined by this mapping and 

such that J2 y ea-i(x) P(v) = 1 for an y x e X - 

Formulae (4)-(6) can be considered as analogues to the known variational principle 
that links entropy and free energy in thermodynamics, and in these formulae the loga- 
rithm of the spectral radius plays the role of free energy. In connection with the problems 
considered it is worth mentioning the works by Maslov where similar relations were in- 
vestigated for certain evolution differential equations of the form 

+ Bu = 0. 

at 

For example, in the paper [41], which is devoted to quantization of thermodynamics, the 
coincidence of free energy of thermodynamic system described by this equation and the 
minimal eigenvalue of the operator B is established. Note that this minimal eigenvalue 
is precisely the logarithm of the spectral radius of the operator e~ B defining the evolution 
of the system. 

The function p arising in (6) in fact marks a crucial interrelation between weighted 
shift and transfer operators in the situation under consideration. Namely, it can be verified 
that in this case 

II ( flT «) n 1 1 L P(X,m) = ' ( 7 ) 

where A: C(X) — > C(X) is a transfer (Perron-Frobenius) operator of the form 

Af(x)= \<pf(y) ( g ) 

(cf. (3)). 

Equality (7) implies the equality 

r(aT a ) =r(A) 1/p . (9) 

The explicit calculation of the norm of aT a by means of formula (7) shows, in partic- 
ular, that in LP the norm of the weighted shift operator aT a with noninvertible shift is 
not equal to the maximum of the weight a as for invertible shift but it is equal to the 
maximum of the weight averaged over inverse images and namely the speed of averaging 
(mixing) under the powers of a is reflected in appearance of the entropy type summand 
in the right-hand part of the variational principle (6). 

In connection with formula (6) it is reasonable to recall the variational principle for 
the topological pressure established by Ruelle [47] and Walters [49]: 



P(a,c)= sup I / c(x)dp: + h{n) J , (10) 
neM a \Jx J 

where c G C(X) is a nonnegative function and P(a, c) is the topological pressure defined 
by a and c. (We would like to stress here that in contrast to (6) a: X — > X in (10) is an 
arbitrary continuous mapping). 
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Comparing formulae (6), (9), and (10) we see that in the case when a is a topological 
Markov chain the following relation is valid 

\nr(aT a ) = ^lnr(A) = ^-P{a,lnifj), (11) 

where ip = \a\ p p. 

The equality in the right-hand part of (11), namely, 

lnr(A) = P(a,lnip), (12) 

has been known, probably, since [45]. In fact, the establishment of the relation between 
the spectral radius of weighted shift operators and topological pressure was the essence 
of the Latushkin-Stepin work. This link along with the observed relation between the 
spectral radii of weighted shift and transfer operators serves as a basement for numerous 
applications of these operators and also inspires the investigation of their spectral proper- 
ties in various functional spaces and, in particular, in the spaces of smooth functions and 
vector-functions (see, for example, [23, 19, 22, 29, 18, 30]). We have to stress again that 
all the mentioned sources deal only with the case when a is a topological Markov chain. 

Remark 1.1 1) In general (that is for an arbitrary continuous mapping a, and even 
when a is a local homeomorphism) the equalities \nr(aT a ) = ^P(a, \nip) and lnr(£>) = 
P(a,\nip) (see (11)), (12)) are not true and (6) is not a generalization of the variational 
principle (4). For example, let us consider an invertible mapping a. Then p = 1. Let us 
set a = 1, thus ip = 1 and we have 

In r{T a ) =0, 

while 

P(a,0) = h(a), 

where h(a) is the topological entropy of the dynamical system (X, a), and in general h(a) 
could be equal to any nonnegative number. 

2) A different proof of the Latushkin-Stepin formulae was obtained in [38, 39] by means 
of newly introduced topological invariants that also gave a number of estimates for the 
spectral radius. In addition it was shown in [38, 39] that the variational principle (4) and 
the Latushkin-Stepin result are in a way the 'extreme points' of the situations one could 
come across when dealing with the calculation of the spectral radius r(aT a ). 

3) Note that if p — > oo then formula (6) transforms into (4) and this agrees with 
the fact (that has been already noted) that in the spaces of C(X) and L°°(X) type the 
variational principle (4) preserves its form. 

In the present article we establish the variational principles for the spectral radii of 
transfer and weighted shift operators for an arbitrary dynamical system (X,a). It will 
be shown that in general these variational principles contain not the Kolmogorov-Sinai 
entropy h(p) as in (6) but a new dynamical characteristics which we call t- entropy. In 
the article t-entropy is calculated explicitly and its dynamical and stochastic meanings 
are uncovered. 
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The description of t-entropy needs the introduction of a deep Legendre transform 
'philosophy' into the process of calculation of the spectral radius. It will be shown that 
the variational principles in fact reflect the Legendre duality for the spectral characteristics 
of transfer and weighted shift operators, and t-entropy is precisely the explicit form of 
the Legendre dual object to the logarithm of their spectral radii. In particular, the 
Kolmogorov-Sinai entropy h(n) arising in the right-hand parts of (6) and (10) is nothing 
else than manifestation of the fact that h(n) is the Legendre dual object to the logarithm 
of the spectral radius of weighted shift operator associated with the topological Markov 
chain and it is the Legendre dual object to the topological pressure for an arbitrary a 
respectively. 

We would like to emphasize that the main mathematical basement and ideology of the 
present paper take their roots in the papers [4, 5, 6, 7, 16]. Namely, particular cases of 
t-entropy and the corresponding variational principles have been considered in [4, 5, 6, 7] 
for the situation when the initial transfer operators are the conditional expectation oper- 
ators and the definition of t-entropy was introduced there in a different way, and in [16] 
the variational principle for the spectral radius of weighted shift operators with positive 
weights in L l (X,m) and the corresponding Entropy Statistic Theorem (see Section 9 be- 
low) were proved. In this paper we give a complete 'final' picture of the operator-entropy 
spectral radius phenomena for the operators in question. Here we present a general op- 
erator algebraic approach establishing the variational principles as for the spectral radii 
of arbitrary transfer so also for weighted shift type operators. The approach developed 
exploits only the algebraic structure of the objects considered and does not lean on the 
spaces of action of operators (thus essentially extending the sphere of applications which 
therefore do not have to be linked with functional operators at all). The results obtained 
also unify all the previous structures and we discuss the interrelation between earlier and 
presented here definitions in Section 12. 

We note also that by developing the idea of the mentioned papers Antonevich and 
Zajkowski [14] proved the convexity of the logarithm of the spectral radii and Legendre 
duality for the sums of weighted shift and transfer operators, and in a number of special 
cases evaluated explicitly the corresponding 'entropies'. 

The article is organized as follows. In the starting Sections 2 and 3 we introduce 
the spectral potential (i. e. the logarithm of the spectral radius of a family of transfer 
operators), examine its elementary properties, and in particular establish its convexity 
with respect to weights, and recall the principal properties of the Legendre transform. 
On this base in Section 4 we introduce the dual entropy as the Legendre dual object 
to the spectral potential and derive the 'Legendre' form of the variational principle we 
are going to investigate. Our main goal is to obtain the variational principle and the 
description of the dual entropy in the explicit form not leaning on the Legendre transform. 
This program is implemented in the further part of the paper. In Section 5 we derive a 
lower estimate for the spectral potential which in a natural way leads to the main object 
of the paper — t-entropy (an explicit dynamical expression for the dual entropy). The 
lower estimate obtained in this section is in fact a sharp estimate and this is the essence 
of the Variational Principle which is formulated in the model situation in Section 5 as 
well. In the next Section 6 we reformulate the Variational Principle for the general 
C*-algebraic setting, which in particular provides us with an essential extension of the 
range of applications, and in Section 7 we present a series of types of transfer operators 
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and C*-dynamical systems naturally arising within the frames of the C*-algebraic picture 
chosen. The proof of the Variational Principle needs the development of an adequate 
mathematical basement and we provide this in Sections 8 and 9 where the principal 
properties of t-entropy are examined and the Entropy Statistic Theorem is proved. The 
latter theorem gives statistical estimates of distribution of empirical measures by means of 
t-entropy and therefore plays for t-entropy the role similar to that the Shannon-McMillan- 
Breiman plays for the Kolmogorov-Sinai entropy. In addition it serves as the key technical 
instrument in the proof of the Variational Principle itself which is given in Section 10. 
Further in Section 11 we strengthen the Variational Principle up to the case of not only 
positive but also nonnegative coefficients of transfer operators. The Variational Principle 
derived tells us in particular that t-entropy plays the principal role in the spectral analysis 
of transfer operators and in Section 12 we describe a number of additional properties of 
t-entropy that have not been exploited in the proof of the Variational Principle. Along 
with this we also present a certain alternative definition of t-entropy and discuss the 
interrelation between the notion of t-entropy introduced in this article and that exploited 
previously in [4, 5, 6, 7]. Finally, in Section 13 we examine the interrelation between 
transfer operators and weighted shift operators acting in L p -type spaces and prove the 
corresponding Variational Principle for the spectral radius of weighted shift operators. 

2 Spectral potential of a transfer operator. 
Starting properties 

Let us consider a Hausdorff compact space X. We denote by C(X) the algebra of con- 
tinuous real- valued functions on X equipped with the uniform norm. Let a : X — ■> X be 
a continuous mapping. This mapping generates the dynamical system with discrete time 
which we will denote by (X, a). 

Definition 2.1 A linear operator A: C(X) — > C(X) is called a transfer operator for the 
dynamical system (X, a) if 

a) A is positive (that is it maps nonnegative functions to nonnegative) and 

b) it satisfies the homological identity 



A typical example of a transfer operator is given by the classical Perron-Frobenius 
operator, that is an operator of the form 



where ip G C(X) is a certain fixed function. This operator is well defined when a is a 
local homeomorphism. Clearly it is positive and satisfies the homological identity. 

Further examples and detailed discussion of transfer operators is presented in Section 7. 

Given a transfer operator A we define a family of operators A v : C[X) — > C{X) 
depending on the functional parameter ip G C(X) by means of the formula 



A(foa-g)=fAg, f,geC(X) 




(13) 



Ayf = A(e"f). 
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Evidently, all the operators of this family are transfer operators as well. Let us denote 
by X(<p) the logarithm of the spectral radius of A v , that is 

\(<p) = lim -lnlU^II . 

n— >oo 77, v 

The posit ivity of transfer operator implies that 

X(ip) = lim -ln|U"l|| , (14) 

n^oo 77, f II 

where 1 is the unit function on X, and ||/|| denotes the uniform norm of the function 
/ G C(X). The functional X(ip) is called the spectral potential or the spectral exponent 
of the transfer operator A (depending on weather we have in mind dynamical or spectral 
associations). In this paper when dealing with the objects associated with X(ip) we are 
staying on the platform of dynamical (entropy, thermodynamics, information, stochastics) 
point of view and therefore throughout the paper X(ip) will be called the spectral potential. 
Our goal is investigation of A (if). 

The next starting proposition gives the list of its principal elementary properties. 

Proposition 2.2 The spectral potential X(ip) is either identically equal to — oo on the 
whole of C(X) or takes only finite values on C(X) and possesses the following properties: 

a) (monotonicity) if ip <ip, then X(ip) < X{jp); 

b) (additive homogeneity) X(<p + t) — X(ip) + t for any t G K; 

c) (strong a-invariance) X(ip + ip) — A(y? + ipoa) for all ip,tp G C(X); 

d) (convexity) X(tip + (1 - t)tp) < tX(ip) + (1 - t)A(^) for all ip,ip G C(X) and 
te[0,l]; 

e) (Lipschitz property) X((p) — < \\ip — for all ip, ip G C(X). 
In particular, the spectral potential is continuous. 

Proof, a) The monotonicity of X((p) follows from the positivity of A. 

b) Let us introduce the notation 

S n ip := ip + ipoa + ■ ■ ■ + ipoa a ~ l , ip G C(X). 

Applying n times the homological identity to the operator A™ = (Ae^)" 1 we obtain 

A^f = A(e*A(e* ■ ■ ■ A(e*f )...)) = A n {e s ^f). (15) 

By substituting this equality into the definition of the spectral potential (14) we deduce 
the additive homogeneity of X(ip). 

c) Note that 

S n {^ + 4)oa) = S n (ip + 4>) +i)oa n -i). 

Therefore 

e~\A^) n < (A vH>oa ) n < e\A^) n , where t = 2 ||^|| . 

Substituting these inequalities into (14) we obtain the strong a-invariance of X(ip). 

d) Let us fix a point i6l and consider the linear functional fi(f) '■= A n f(x) on C(X). 
This functional is positive and therefore by the Riesz theorem it can be identified with a 
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certain Borel measure on X. By applying the Holder inequality (with p = \ and q = j^) 
to this measure we obtain 

Ai (e s »(^+( 1 -*W) < (/i(e 5 ^))*(/i(e^)) 1 "*, t G (0, 1). 

This inequality implies in turn that the functional ln/z(e 5nip ) = lnL4"l](x) is convex with 
respect to ip G C(X). Therefore the spectral potential 

\(ip) = lim -ln|U"l|| = lim - sup ln[A"l](x) 

is convex with respect to ip as well. 

e) The monotonicity and additive homogeneity of the spectral potential imply its 
Lipschitz property. Indeed, 

h<p) - m < w + \w - n) - a w = \w-n- 

Finally, the Lipschitz property implies that \(<p>) is either identically equal to — oo on 
the whole of C(X) or takes only finite values on C(X). □ 

3 Convex functionals and the Legendre transform 

Proposition 2.2 shows in particular that the spectral potential is a convex functional 
on C(X). As is known among the standard instruments of investigation of convex func- 
tionals is the Legendre transform. In this section we recall the principal notions and facts 
related to this transform (in essence they are borrowed from [25]). 

Let / be a functional on a real Banach space L with the values in the extended real 
straight line R = [— oo, +00] . The set D(f) — {ip G L | f(ip) < +00} is called the effective 
domain of the functional /. The functional / is called convex, if for all (p,ip E D(f) and 
t G [0, 1] the following inequality holds 

f(t<p + (l-t)rl>)<tf(<p) + (l-t)f(il>). 

The functional / is called lower semicontinuous if the set {ip G L \ f(ip) > c} is 
open for any c G K. One can speak about lower semicontinuity with respect to the norm 
topology or with respect to the weak topology on L, but for convex functionals these 
properties are equivalent. 

Let L* be the dual space to L. The functional /* : L* — > R that is defined on the dual 
space by the equality 

f*{fi) = sup (fi{<p) - f{<p)) = sup (n((p) -f((p)), fieL*, 

<peL <peD(f) 

is called the Legendre dual to the functional / (or the Legendre transform of /). For a 
functional g on the dual space the Legendre transform is defined as the functional on the 
initial space given by the similar formula: 

g*{ip) = sup (n(ip) -g((i)), <peL. 
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Proposition 3.1 Let a functional f : L — > (—00, +00] be not identically equal to +00. 



a) the dual functional f* is convex and lower semicontinuous with respect to *-weak 
topology on the dual space; 

b) if the functional f is convex and lower semicontinuous then f = (/*)* (the Legendre 
transform is involutory); 

c) in general (/*)* is the maximal convex lower semicontinuous functional that does 
not exceed f (the convex hull of f). 

The analogous statements are valid for functionals g: L* — > (—00, +00]. 

We have already proved that the spectral potential X(tp) is convex and continuous 
(see Proposition 2.2). Therefore it can be represented as the Legendre transform of its 
Legendre dual on the dual space C*(X). However we will slightly modify the form of the 
record of this duality. The matter is that in thermodynamics, information theory and 
ergodic theory there is a tradition to change the sign of the dual to X(ip) and the result 
obtained is called the entropy. Following this tradition and, in particular, the results 
of [9], we define the dual entropy S(/i) of the spectral potential X(ip) by means of the 
formula 



Since the dual entropy differs from the dual to X(ip) functional only by sign it follows 
that S(fi) is concave and upper semicontinuous (with respect to *-weak topology). As the 
Legendre transform is involutory the next equality holds true 



This equality is in fact the simplest form of the subject of the article — the Variational 
Principle for X((p). The foregoing discussion implies the existence of the dual entropy and 
its explicit construction by means of formula (16) is possible provided that the spectral 
potential is known. However, the spectral potential itself is a rather hard object to 
investigate. Our principal goal is the independent derivation of the explicit formula for 
the dual entropy, not leaning on the spectral potential. This formula allows, in particular, 
to impart a more effective character to the Variational Principle. The principal result for 
a model example will be formulated in Section 5, its general C*-formulation is given in 
Section 6 and its complete proof will be given in Section 10. 

4 Dynamical potentials and dual entropy 

Certain useful information on the dual entropy S(fi) can be extracted by means of the 
Legendre duality from the properties of the spectral potential already proved. In essence 
this job was implemented in [15]. For completeness of description we present a part of 
the material of this article in this section. 

Let, as above, X be a Hausdorff compact space and a : X — > X be a continuous 
mapping. We denote by M C C*(X) the set of all linear positive normalized functionals 
on C(X) (that is linear functionals that take nonnegative values on nonnegative functions 
and are equal to 1 on the unit function). By the Riesz theorem these functionals are 



Then 




(16) 



\((p)= sup (fi(<p) + S(n)). 



(17) 
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bijectively identified with regular probability Borel measures on X, and the elements 
of C*(X) are identified with regular real- valued Borel measures on X. Therefore with a 
slight abuse of language all the elements of C*(X) will be referred to as measures. 

A measure /i G M is called a-invariant if fi(foa) = //(/) for all functions / G C(X). 
This is equivalent to the identity fi(a~ 1 (G)) = fi(G) for all Borel subsets G C X. The 
collection of all a-invariant measures from M will be denoted by M a . 

Recall that according to [15] a dynamical potential of a dynamical system (X, a) is an 
arbitrary real- valued functional X(ip) on C(X), possessing the following four properties 

a) monotonicity (if ip < ip, then X(ip) < X(ip)); 

b) additive homogeneity (X((p + t) — X(ip) + t for any t G R); 

c) strong a-invariance (X(ip + ip) = X(<p> + ipoa) for all <p,ip & C(X)); 

d) convexity with respect to ip. 

Proposition 2.2 means precisely that the spectral potential of a transfer operator is a 
particular case (and a typical example) of a dynamical potential. 

Let X(ip) be any dynamical potential on C(X). We define the dual entropy S(fj) for this 
dynamical potential just as for the spectral potential by means of the same formula (16). 
Clearly formula (17) holds true as well. 

An equilibrium measure, corresponding to a function <p G C(X), is an arbitrary sub- 
gradient of the functional X(ip) at the point <p (in other words it is a linear functional 
//: C(X) -> R such that \(<p + ip)- X((p) > fi(if>) for all ip G C{X)). Evidently the set of 
all equilibrium measures corresponding to a certain function ip is convex and closed (with 
respect to the *-weak topology). This set is nonempty by the convex analysis theorem on 
the existence of a supporting hyperplane. It consists of a unique measure /i if and only if 
there exists the Gateaux derivative X'(ip). In this case /x = X'(ip). 

The definitions of the dual entropy and equilibrium measure imply 

Proposition 4.1 For any function ip G C(X) and any measure ji G C*(X) the Young 
inequality holds true 

S(ii)<\{<p)-K<P)- (18) 
This inequality turns into equality iff \i is an equilibrium measure corresponding to ip. 

Proposition 4.2 The effective domain of S is contained in M a , that is if S(/i) > — oo ; 
then n is a probability and a-invariant measure. In particular this is true for all equilib- 
rium measures. 

Proof is implemented by contradiction. By the additive homogeneity of the dynamical 
potential for all t G R we have 

S(jM)<\(t)-IM(t)=\(0)+t(l-IM(l)). 

Therefore, if /x(l) ^ 1 then S(/j) = — oo. 

Suppose that /i(ip) < for some nonnegative function ip G C(X). By the monotonicity 
of the dynamical potential for all t > the following inequalities hold 

S(fi) < X(-tip) - /i(-tip) < A(0) + tfji(<p), 

and again S(n) = — oo. 
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The invariance of a measure \i is equivalent to the identity /i(<p) = fi(ipoa), tp G C(X). 
Suppose that /i(y?) ^ fi(ipoa) for a certain function <p. By the strong invariance of the 
dynamical potential we have 

\(tip-t(poa) = A(0). 

Therefore, 

S(n) < X(tip — tipoa) — fi(tip — tipoa) = A(0) — t(fi((p) — /i((/}o«)) 

and thus S(fi) = — oo. 

Finally, if \i is an equilibrium measure corresponding to a function <p e C(X) then the 
foregoing proposition implies S(fx) = X(ip) — My?) > — oo. □ 

The proposition just proved shows that it suffice to define the dual functional (dual 
entropy) only on invariant probability measures. This will be done in the next section. 

Proposition 4.3 // S(fi) > —oo, then /j, belongs to the closure of the set of equilibrium 
measures (with respect to the norm of the space C*(X)). 

Proof. A particular variant of the Bishop-P helps theorem [21] states that if A(</?) is a 
continuous convex functional on a Banach space L, \i e L*, and the difference A — /x is 
bounded from below then the distance from // to the set of subgradients of A is zero. □ 

Proposition 4.2 implies that the supremum in (17) is attained on the set of invariant 
probability measures M a . Therefore, every dynamical potential has the form 

\(<p)= sup (//(</?) +S(//)), (19) 

where S(/i) is the corresponding dual entropy. 

Proposition 4.1 implies in turn that supremum in (19) is in fact maximum that is 

\(<p) = max( f i(<p)+S( f i)), (20) 

and this maximum is attained precisely on equilibrium measures, corresponding to the 
function ip. 

The forgoing observation implies in addition that the uniqueness of an extremal mea- 
sure in (20) is equivalent to the existence of the Gateaux derivative A'(</?). 

Let us present one more statement describing the structure of dynamical potentials. 

Proposition 4.4 If S(fx) is an arbitrary function which is bounded from above on M a , 
then formula (19) defines a dynamical potential. 

Proof of this fact reduces to a trivial verification of all the conditions of the definition of 
the dynamical potential. □ 

Remark 4.5 Since the dual entropy satisfies the inequality S(fi) < A(0), it follows that 
formula (19) implies the estimate 

X((p) < sup fi((p) + A(0). 
neM a 



12 



On the other hand, if we take S(fx) = A(0), then (19) defines the dynamical potential 

A (v?) := sup /x((p) + A(0). 
^eM a 

The previous estimate shows that this dynamical potential is maximal among all the 
dynamical potentials with the fixed value A(0). 

5 Lower estimate for the spectral potential, 
t-entropy, and formulation of the Variational 
Principle in the model setting 

In the previous section we defined the dual entropy of an arbitrary dynamical potential 
by means of the Legendre transform. Let us consider now a particular case when the dual 
entropy is generated by the spectral potential of a transfer operator. Our goal is to derive 
a direct definition for this dual entropy not leaning on the Legendre duality but only on 
the properties of the initial dynamical system and the transfer operator chosen. As it was 
already noted Proposition 4.2 implies that it suffice to define the dual entropy only on 
invariant probability measures. 

Let A: C(X) — > C(X) be a transfer operator with the spectral potential X((p), asso- 
ciated with a dynamical system (X,a), where a is a continuous mapping of a compact 
space X. 

Lower estimate. Preliminaries. We start with a certain heuristic reasoning leading 
us to the definition we are looking for. Let us show how for any invariant probability 
measure fx one can find a number C(fx) satisfying inequality (18), that is 

C{fx)< \(<p)-»(<p). 

Recall the notation 

S n ip := ip + ipoa H h¥»oo! n ~ 1 . (21) 

Consider the expression 

ln|U"l|| = max In \A n (e Sn{(p) )] (x). 

II <P II xgI L ^ 

Given an invariant probability measure fx we have to find a number C(fx) such that 
for every <p the next lower estimate holds 

-ln||^l|| > / <pdfx + C(fx). 
n Jx 

By the invariance of fx we have 

/ S n (pdfx — n / (pd/x. (22) 
Jx Jx 

Let us replace the integral in the left-hand part of (22) by an integral sum (with an 
accuracy up to e) . To implement this we take a (sufficiently fine) partition of X 

k 

D = {D 1 ,...,D k }, X = ]jD t , 

i=l 
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where each A C X is a Borel set such that the oscillation of the function S n ip on A does 
not exceed e. Set 

S n ip(Di) : = sup{ S n ip(x) \ x e Di}. 



Then 



^2 S n ip(Di)n(Di) - e < I S n ip dfi<^2 £"«</?( A)M A 
i=i J x i=l 



)■ 



Let us estimate the value of ln[A"l](x), where i 6 I is arbitrary. Denote by gi the 
index function of the set D^. Then the following inequalities hold true 



ln[A r ;i](x) =lnJ2[A n {e s ^g t )](x) 



> 



K^](x) = ln 



i=l 



E^ (Di) [^](*o) 



i=l 



£. 



To estimate the logarithm first we estimate the sum obtained 



J2e SMDl) [A n gi ](x) > EMA)e S " v(A 



i=l 



[A n g t }(x) 

MA) ' 



where we assume that if MA) = for a certain D iy then we set the corresponding 
summand in the right-hand part to be zero (regardless the value of [A n gi](x)). Now we 
exploit the concavity of the logarithm function and (22): 

e s ^( D i)[A n gi ](x) 



ln[^l](*)> E MA) In j^- 



= Yl MA)MA)+ E MA)ln^i^-: 



/i(£»i)^0 



/i(£»i)^0 



MA 



— n J pdfi+ MA) m 



[^ w gi](s) 

MA) 



£. 



This implies the inequality 
1 



n 



■HKn > / v 



rf/X+-SUp > MA In TTTT 

n „^ n MA) 



(23) 



The essential point here is that the second summand in the right-hand part of the 
inequality does not depend on <p. Since we are seeking the limit inequality as n — > oo we 
naturally arrive at the introduction of the new value: 



C(M:=mfmf-sup V MA) In I^gM , 
vw „ eN d n x6 £ ^„ ^ MA) 



(24) 
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Now tending n in (23) to infinity and using (24) and the arbitrariness of e > we obtain 
the estimate of the form required 

A(y?) > / cpdfi + C(fi). (25) 
Jx 

Let us make two observations regarding the reasoning presented. 

1) The index functions gi exploited in the calculation are discontinuous. As a rule, in 
examples the action of the operator A is naturally defined on these functions as well, but 
in general the expression A n gi is not defined an therefore the reasoning presented needs a 
modification. Because of this in what follows we will consider certain partitions of unity 
in the algebra C(X) instead of partitions of the space X. 

2) Expression (24) for the constant that satisfies estimate (25) is not the best one. One 
can improve it by means of the following consideration. The functional 5 x (f) = f(x) can 
be identified with the probability measure concentrated at the point x and so estimate 
(23) can be considered as an estimate obtained by means of the measures S x . It will be 
shown below that one can replace [A n gi](x) = S x (A n gi) in (23) by m(A n g, i ), where m is 
an arbitrary probability measure on X. Therefore the supremum over x G X in (23) can 
be replaced by the analogous supremum over all probability measures on X, which brings 
us to a definition of a certain new value t(/i) greater than C(/i). The principal result of 
this article is the proof of the fact that the value of the constant r(/i) obtained in this 
manner is the maximal possible one, which in turn implies the Variational Principle. 

From the geometrical point of view the situation here is the following. The set M 
of all probability measures on X is convex and the measures 5 X are the extreme pints of 
this set. The expression obtained in the corresponding estimate depends concavely on 
measure m G M. Therefore the supremum over the set M in the general case is greater 
than the supremum over the extreme points of this set. 

Now we pass to the strict reasoning. 

By a partition of unity in the algebra C(X) we mean any finite set D = {g ± , . . . , g k } 
consisting of nonnegative functions gi G C(X) satisfying the identity g 1 + • ■ • + g k = 1. 

Our first step is the next technical lemma which gives a key to derivation of a lower 
estimate for the spectral potential. 

Lemma 5.1 For any n G N and \x G M a the following estimate holds true 




(if /j,(g) = then we put the corresponding summand in (26) to be zero). 

Proof. Take any positive integer n and a number e > 0. For these numbers let us choose 
a (fine) partition of unity D in the algebra C(X) such that the oscillation of the function 
S n ip = ip + ipoa + ■ — h ipoa n _1 on the support of each function g G D does not exceed e. 
Set 

SnVi.9) ■= sup{ S n <p(x) | g(x) ^ }. 
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Equality (15) and concavity of the logarithm function imply the following inequalities for 
all probability measures \i G M a and m G M: 

£ + mm(A"l) = E + \n^m(A n (e Sn *g)) > lnj^ e Sn ^ a) m(A n g) 

geD geD 

e s ^m(A n g) ^ / Ni e s "^m(A n g) 



E M^^)) + E^)ln^f^> E M^)+ E M^ln^ } 

M^)+ E Ms)m^^. 



Passing in these inequalities to the supremum over m G M we obtain the inequality 

m(A n g) 



e + ln||^l|| >n{S n ip) + sup E Mff) ln 



This inequality along with a-invariance of /i and the arbitrariness of e > implies 
ln||^l|| > W (<p) + inf sup E M<?)ln^^. □ 



Corollary 5.2 For any /i G M Q we nave 

Afa) > /x(^) + inf - inf sup E M») ^ (27) 

Proof. Divide (26) by n and tend n to infinity. □ 

The lower estimate for the spectral potential obtained in Corollary 5.2 makes it nat- 
ural the introduction of the following object (t-entropy), which in essence describes the 
expression in the right-hand part of (27). 

Definition 5.3 T-entropy is the functional r on M a such that its value at pi G M a is 
defined by the following formulae 

t(ji) := .inf ^ , tM := inf r„(/i, D), (28) 

m(A n g) 



r„(/i, D) := sup E M<?) ln -"TY^ ■ ( 29 ) 



The infimum in (28) is taken over all the partitions of unity D in the algebra C(X). If 
we have fi(g) = for a certain function g G D, then we set the corresponding summand 
in (29) to be zero independently of the value m(A n g). And if there exists a function g G D 
such that A n g = and simultaneously /i(g) > 0, then we set r(/i) = — oo. 
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Remark 5.4 As we know the Kolmogorov-Sinai definition of entropy h(fi) is based on the 
notion of entropy of a measure fi with respect to a partition of space and it characterizes 
the behavior of the entropy of partition under its refinement by means of the mapping a. 
Recall that given a partition of the space X by measurable subsets 

X = ]jD t 

i 

its entropy is defined by the expression 

-^/i(A)hiMA). (30) 

i 

Therefore it seems reasonable to look for lower estimates for the spectral potential that 
contain expressions similar to (30). 

Similarly to (30), given a partition of unity D one can consider the sum 

i 

as the entropy of the measure fx assigned to this partition, and the sum 

-$^M#i) lnm (#i) 

i 

can be considered as the corresponding relative entropy of the measure m. 
Then the expression 

i ^ 9i > i i 

can be interpreted as the entropy distance (difference) from m to fx. 
Thus the expression 

mf (-y:Min^4) 

may be treated as the entropy distance from measure fx to the image of the adjoint oper- 
ator A* n or the 'shortage of the entropy'. Here the images L n = A* n (M) form a decreasing 
chain, thus the distance increases and r(/i) characterizes the speed of the increase of the 
distance. 

From this point of view the case when a is reversible is fairly vivid: L n = M and the 
distance is zero. 

In the notation of Definition 5.3 inequality (27) takes the form A(y?) > fx((p) + r(fi) 
and therefore Corollary 5.2 can be rewritten as 

Theorem 5.5 [The lower estimate of the spectral potential] 

A(y?) > sup (fx(cp) + r(fx)). (31) 
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Now we are ready to formulate the principal result of the article. Its essence is that 
inequality (31) is in fact equality, and the restriction of t-entropy t(/x) onto the set of 
invariant measures M a coincides with the dual entropy S(fi). Namely, the following 
theorem holds true. 

Theorem 5.6 [Variational Principle in the model setting] Let A: C(X) — > C(X) 
be a transfer operator for a continuous mapping a : X — > X of a Hausdorff compact 
space X . Then its spectral potential X(ip) satisfies the variational principle 

A( V 9) = max(/i( V 9) + r( /U )), <peC(X), (32) 

and t-entropy satisfies the equality 

r(/i)= inf (\(<p)-n(<p)), fieM a . (33) 

Remark 5.7 1) In fact the equalities established in this theorem are much deeper than 
simply the explicit calculation of the Legendre dual objects arising in the procedure of 
spectral radius evaluation. Much more important (from our point of view) is the obser- 
vation that formula (32) links the spectral characteristics of the transfer operator (the 
left-hand part) with the stochastic characteristics (t(/i) in the right-hand part) of the dy- 
namical system. This ideology will be developed further in Section 9 where in particular 
the interrelation between r(/x) and the distribution of empirical measures is described. 
Namely this interrelation will serve as a key instrument in the proof of the Variational 
Principle. 

2) Formula (32) reveals the partition of the process of calculation of the spectral radius 
into the static component (the first summand in the right-hand part depends only on the 
weight ip) and the dynamical component (the second summand depends only on the shift 
a and the transfer operator A). 

3) The duality established in Theorem 5.6 and the thermodynamic formalism de- 
veloped in [9] leads naturally to introduction of the thermodynamic 'ideology' into the 
spectral analysis of transfer operators. Having in mind this motivation it is reasonable to 
call the functionals (measures) \i at which the maximum in the right-hand part of (32) 
is attained the equilibrium states. We recall in this connection that in accordance with 
a common physical point of view the equilibrium states are the states at which the sys- 
tem 'exists in reality'. From this point of view the duality principle adds dialectics to 
the spectral analysis of transfer operators: since r(/x) describes the measure of the 'most 
typical' trajectories (see, in particular, Section 9) and the value fi(ip) calculates the 'living 
conditions' (recall the corresponding discussion in [7]) then the duality principle tells us 
that the process realizes at a state having the best combination of these components. 

Theorem 5.6 can be extended up to the case of not only positive but also nonnegative 
coefficients of transfer operators. Namely, let us consider ip := e v and denote by £(ip) the 
logarithm of the spectral radius of the operator Aip = A(ip ■ ). We have by definition that 
£(ip) = A (In ■?/>), and formula (32) takes the form 

t(ib)= max (/ ln^ du + rtu)). (34) 
neM a (C) \J X J 

Theorem 5.6 tells that this formula holds true for all strictly positive functions ip G C(X). 
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Theorem 5.8 Under the conditions of Theorem 5.6 formula (34) holds true for all non- 
negative functions ip G C(X). 

Clearly equality (34) is stronger than (32). Nevertheless, it seems worthwhile to con- 
sider formula (32) as the main form of the Variational Principle since it is more natural 
from the point of view of the Legendre duality. 

6 Variational Principle: C*-algebraic picture 

We have formulated the Variational Principle for the spectral potential of a transfer 
operator in the model setting when the phase space X is compact, the mapping a is 
continuous and the transfer operator acts in C(X). In fact the most natural operator 
picture can be written by means of C*-algebraic language. In this section we present this 
picture. The C*-algebraic language, apart from its universality, will provide us with an 
essential extension of the range of applications. 

6.1 We will call C a base algebra if it is a selfadjoint part of a certain commutative C*-al- 
gebra with an identity 1. This means that there exists a commutative C*-algebra B with 
an identity 1 such that 

C = {beB\b* = b}. 

As is known the Gelfand transform establishes an isomorphism between C and the 
algebra C(X) of continuous real- valued functions on a Hausdorff compact space X, which 
is the maximal ideal space of the algebra C. Throughout the article we identify C with 
C(X) mentioned above. 

The next known result (see, for example, [48]) establishes a correspondence between 
endomorphisms of base algebras and dynamical systems. 

Theorem 6.2 If 5 : C — > C is an endomorphism of a base algebra C then there exists 
an open-closed subset Y C X and a continuous mapping a: Y — > X (both Y and a are 
uniquely defined) such that 

[Sf] (x) = X Y(x)f(a(x)), feC, xeX, 

where xy is the index function of Y. In particular if 8(1) = 1 then Y = X and 

[5f](x) = f(a(x)). (35) 

Remark 6.3 It is clear that any endomorphism of a C*-algebra B is completely defined 
by its restriction onto the selfadjoint part C of B and on the other hand any endomorphism 
of C extends uniquely up to an endomorphism of B. Therefore the correspondence between 
endomorphisms and dynamical systems presented in the theorem can be equally described 
in terms of endomorphisms of B. 

6.4 In what follows the pair (C,5), where C is a base algebra and 5 is its certain endo- 
morphism such that 5(1) = 1, will be called a C*-dynamical system, and the pair (X,a) 
described in Theorem 6.2 will be called the dynamical system corresponding to (C, S). The 
algebra C will be also called the base algebra of the dynamical system (X, a). 
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Throughout the paper notation C, 5, X, a will denote the objects introduced above 
and we will use either of them (say S or a) for convenience reasons (once a is chosen then 
5 is defined uniquely by (35) and vice versa). 

Definition 2.1 of transfer operator can be naturally rewritten in terms of C*-dynamical 
systems. 

Definition 6.5 Let (C,5) be a C*-dynamical system. A linear operator A : C — > C will 
be called a transfer operator, if it possesses the following two properties 

a) A is positive (it maps nonnegative elements of C into nonnegative ones); 

b) it satisfies the homological identity 



If in addition this operator maps 1 into 1 we will call it a conditional expectation operator. 

Remark 6.6 Any transfer operator A: C — > C can be naturally extended up to a transfer 
operator on B = C + iC by means of the formula 



On the other hand given any transfer operator on B, its restriction to C (which is well 
defined in view of property a) of Definition 6.5) is also a transfer operator. Therefore 
transfer operators can be equivalently introduced as by means of C*-algebra B so also by 
means of its selfadjoint part — the base algebra C. We prefer to exploit the base algebra 
since in what follows we use the Legendre transform which is an essentially real-valued 
object. 

Remark 6.7 1) In view of (36) the conditional expectation operator satisfies the equality 



Thus this operator is a positive left inverse to the mapping / i— > 5f. 

2) If A: C — > C is a transfer operator and ^-eC then -^A is a conditional expectation 
operator. 

A more detailed analysis and various types of C*-dynamical systems and transfer 
operators will be presented in the next Section 7. 

6.8 Let (C, 5) be a C*-dynamical system and (X, a) be the corresponding dynamical 
system. We denote by M(C) the set of all positive normalized linear functionals on C 
(which take nonnegative values on nonnegative elements and are equal to 1 on the unit). 
Since we are identifying C and C(X), the Riesz theorem implies that the set M(C) can be 
identified with the set of all regular Borel probability measures on X and the identification 
is established by means of the formula 



A((5f)g)=fAg for all f,geC. 



(36) 



A(f + ig)=Af + iAg. 



A(6f) = A((6f)l) =fAl = f for every feC. 
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where fx in the right-hand part is a measure on X assigned to the functional fx G M(C) 
in the left-hand part. That is why with a slight abuse of language we will call elements 
of M(C) measures. 

A measure fx G M{C) is called 5-invariant if for each / G C we have fx{f) = fx{5f). 
The set of all 5-invariant measures from M(C) will be denoted by Mg(C). Clearly, in 
terms of the dynamical system (X, a) the condition fx{f) = fx{5f) is equivalent to the 
condition fx(f) = fx(foa), f G C(X). Therefore M S (C) can be identified with the set of 
all a-invariant Borel probability measures on X. 

6.9 Let A: C — > C be a fixed transfer operator for a C*-dynamical system (C, 5). Just as 
in Section 2 we define the family of operators A v : C — > C, where tp G C, by means of the 
formula A^f := A(e v /). Evidently, all the operators of this family are transfer operators 
for (C, 5) as well. 

In view of the foregoing observations the definitions of the spectral potential \(<f>) 
(equality (14)) and t-entropy (Definition 5.3) can be carried over word by word to the 
C*-dynamical systems case. Here they are: 

\(<p) = lim - In 11^1 II , ^eC; (37) 
r(fx) := inf , T n (fx) := inf T n (fx, D), (38) 

neN n D 

T n (fi,D):= sup J2fx(g)\n m (f 9 > , /i G M$(C). (39) 

The infimum in (39) is taken over all the partitions of unity D in the algebra C. 
Once this is done Theorem 5.6 takes the following form. 

Theorem 6.10 [Variational Principle: C*-formulation] Let (C,6) be a C* -dyna- 
mical system, A : C — > C be a certain transfer operator for (C,6), and A v = A(e v •) 
for all ip G C. Then the corresponding spectral potential \(<p) satisfies the variational 
principle 

\{ip) = max (fx(ip) + r(fx)), ip G C, (40) 

where M${C) is the set of all positive normalized 5-invariant linear functionals on C. 

The proof of Theorem 6.10 exploits a number of special properties of t-entropy. There- 
fore before proceeding to the proof of the Variational Principle itself (which will be given 
in Section 10) we will implement the necessary analysis of t-entropy. The results of this 
analysis are presented in two Sections 8 and 9 where in the first one we describe the an- 
alytic properties of t-entropy while the second one is devoted to its statistical properties. 

The C*-analogue to Theorem 5.8 will be formulated and proved in Section 11. 
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7 Types of C*-dynamical systems 
and transfer operators 

In this section we present a number of types (which we call examples) of C*-dynamical 
systems and transfer operators that show, in particular, how far away from the model 
situation presented in Section 2 one can move. In addition we give a complete description 
of transfer operators. 

Let us start with certain examples of C*-dynamical systems and base algebras. 

Example 7.1 Let Y be a measurable space with a a -algebra 21 and (3 : Y — > Y be a 
measurable mapping. We denote by (Y, f3) the discrete time dynamical system generated 
by the mapping {3 on the phase space Y . Let B be any Banach algebra such that 

a) B consists of bounded real-valued measurable functions on Y , 

b) it is supplied with the uniform norm, 

c) it contains the unit function, and 

d) it is /3-invariant (that is f ' o(3 G B for all f G B). 

Clearly the mapping 5 : B — > B given by 5(f) := fo(3 is an endomorphism of B and 
therefore (B,S) is a C* -dynamical system with the base algebra B. 

Example 7.2 As a particular case of the base algebra in the previous example one can 
take the algebra of all bounded real-valued measurable functions on Y. We will denote 
this algebra by B(Y). 

Example 7.3 Let (Y, 21, m) be a measurable space with a probability measure m, and 
let (3 be a measurable mapping such that m(/3 _1 (G)) < Cm(G), G G 21, where the 
constant C does not depend on G. In this case one can take as a base algebra the 
space L°°(Y, m) of all essentially bounded real- valued measurable functions on Y with the 
essential supremum norm. 

Remark 7.4 1) If, as in Example 7.2, C = B{Y) then the elements of M{C) can be 
naturally identified with finitely-additive probability measures on the a-algebra 21 by 
means of the equality //(/) = j Y f dfi, f G B(Y). 

2) If, as in Example 7.3, C = L°°(Y, m) then M{C) consists of finitely-additive proba- 
bility measures on 21 which are absolutely continuous with respect to m (that is they are 
equal to zero on the sets of zero measure m). 

3) In Example 7.3 the set M a (C) is the subset of M(C) consisting of measures /i such 
that n(f3~ l {G)) = fJ>(G) for each measurable set G. 

4) It should be emphasized that in general given a concrete functional algebra its en- 
domorphism is not necessarily generated by a point mapping of the domain. For example, 
if C = L°°(Y, m) then its endomorphisms are generated by set mappings, that do not 'feel' 
sets of measure zero (see, for example [50], Chapter 2). Thus not every endomorphism of 
L°°(Y,m) is generated by a certain measurable mapping (3 as in Example 7.3. 

On the other hand, as Theorem 6.2 tells, on the maximal ideals level any endomor- 
phism is induced by a certain point mapping. Therefore raising the apparatus of investiga- 
tion to the C*-algebraic level (Definition 6.5) we not only essentially extend the sphere of 
the objects under consideration but additionally can always exploit point mappings in the 
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study of transfer operators independently of their concrete origin (see in this connection 
the general description of transfer operators given below). 

Now let us present certain examples of transfer operators. 

Example 7.5 Let X be a compact space, a : X — > X be a homeomorphism and C = 
C(X). Consider a weighted shift operator A : C(X) — > C(X) given by the formula 

[Af](x) = a(x)f(a- 1 (x)) 1 

where a G C(X) is a certain nonnegative function. Evidently A is a transfer operator. 

The next example can be considered as the main model example for transfer operators 
discussed in the article. 

Example 7.6 Let (Y, 21) be a measurable space with a cr-finite measure m, and let (3 be 
a measurable mapping such that for all measurable sets G G 21 the following estimate 
holds 

m(p-\G)) < Cm(G), 

where the constant C does not depend on G. For example, if the measure m is /5-invariant 
one can set C — 1. Let us consider the space L 1 (Y,m) of real- valued integrable functions 
and the shift operator that takes every function / G L l {Y, m) to f of3. Clearly the norm of 
this operator does not exceed C. The mapping 5f := fof3 acts also on the space L°°(Y, m) 
and it is an endomorphism of this space. As is known, the dual space to L X (Y, m) coincides 
with L°°(Y,m). Define the linear operator A: L°°(Y,m) — > L°°(Y, m) by the identity 

J f ■ gopdm = J^(Af)gdm, g G L\Y,m). 

In other words A is the adjoint operator to the shift operator in m). If one takes as 
g the index functions of measurable sets G CY, then the latter identity takes the form 

/ / dm = / Af dm. 

Jf5- 1 (G) JG 

Therefore Af is nothing else than the Radon-Nikodim density of the additive set function 
pLf(G) = /^-i( G ) f dm. Evidently, the operator A is positive and satisfies the homological 
identity 

A((Sf)g) =fAg, f,geL°°(X,m). 

We see that A is a transfer operator (for the C*-dynamical system (L°° (Y, m) , 5)) . And 
in the case when m is /5-invariant measure it is a conditional expectation operator. 

Transfer operators and positive functionals. By identifying the base algebra C with 
C(X) one can also obtain a certain 'more explicit' description of transfer operators linking 
them with special families of positive functionals. Here it is. 

Let, as usual, X be a compact space, a : X — > X be a continuous mapping, and 
A: C(X) — > C(X) be a certain transfer operator. 
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For every point x G X define the functional <\> x according to the formula 



<f>x(f) : = [Af] (x), feC(X). (41) 

Evidently, <p x is a positive functional. 
There are two possibilities for x. 

1) [Al](x) = 0. This means that (f> x (l) = which implies <p x = due to the positivity 

of (f> x . 

2) 7^ 0. In this case <p x ^ and 4> x defines a certain measure v x on X. 
The homological identity implies also that for any / G C(X) we have 

[A(foa)](x) = [A(fo a .l)](x)=f(x)-Al(x), 

and therefore 

1 

M/ OQ = /0*0» 



Alfa;) 



which means that 



supp v x C a (x). (42) 



Clearly, the mapping x — > <p x is *-weakly continuous on X. 

Observe also that if x ^ a(X) then AL(x) = 0. Indeed, if Al(x) ^ then choosing a 
function / G C(X) such that 

f\a(x) = and f(x) = 1 
and exploiting the homological identity one obtains 

0=j^[A{foa)](x)=f{x) = l 

thus arriving at a contradiction. 

The objects presented above in fact give a complete description of transfer operators 
in C(X) since one can easily verify that every *- weakly continuous mapping x i— > <f> Xj 
where (f> x are positive functionals such that 

a) <p x = 0, x a(X), 

b) <p x satisfies (42) for x G a(X) (here X may be as well) 

defines a certain transfer operator A: C(X) — > C(X) acting according to formula (41). 

Remark 7.7 1) If a: X — > X is a homeomorphism then the foregoing discussion shows 
that any transfer operator A: C(X) — > C(X) is a weighted shift operator mentioned in 
Example 7.5. 

2) In general for a continuous mapping a : X — > X even a conditional expectation 
in C(X) (if it exists) is not defined in a unique way. 

For example, consider X = T 1 = R/Z and the mapping a : X — > X defined by the 
formula a(x) = 2a; (mod 1). Take any continuous function p on X having the properties 

0<p(x)<l, + J ) +p(x) = 1, xGX, 
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Evidently, the operator 



M(.):= E /(^)-/(|)p(f) + /(^)p(^) 

is a conditional expectation operator for (X,a). 

3) Recalling Remark 7.4 4) we have to stress that in general given a concrete func- 
tional algebra and its endomorphism then a transfer operator is not necessarily associated 
with a point mapping of the domain. 



8 Properties of t-entropy 

In this section we prove a number of properties of t-entropy and in particular its upper 
semi continuity. 

Let us start with a simple but important observation. 

Proposition 8.1 The functional r(/i) satisfies the inequality r(/i) < A(0). 

Proof. Since logarithm is a concave function it follows that for any partition of unity D 
in (39) we have 

T n (n,D)= sup VV(g)ln m( ^ ^ < sup lnVmfA^) 

meM(C)^ M m eM(C) ~ 

= sup \nm(A n l) =m||A n ||. 

mgM(C) 

Which implies the desired inequality. □ 

Remark 8.2 In particular, if A is a conditional expectation operator (i.e. Al = 1) then 
\\A n \\ = 1 and therefore A(0) = 0. Thus in this case r(/x) < 0. 

Proposition 8.3 If A : C C is an invertible conditional expectation operator then 
r(/i) = for any /i G M(C). 

Proof. Since A is a conditional expectation operator we have A(8f) — f, f e C, and 
hence A -1 / = Sf. This implies in particular that A^ 1 : C — > C is a positive mapping. For 
any /i G M(C) and n£N define the functional m by the formula 

m(/):=M^" n (/)), feC. 
Clearly, m G M(C). For this m we have 

^ m(A n c/) 

^ M " )ln ^T = ' 

and therefore r n (/i, -D) > and t(/i) > 0. Combining the latter inequality with Remark 8.2 
we obtain r(/i) = 0. □ 
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Proposition 8.4 The junctionals T n (/i,D) and r(/i) depend concavely on /i G M(C). 

Proof. Let //i,//2 £ M(C) and /x = pi/ii + P2H2, where pi +p 2 = 1 and pi,P2 > 0. Then 
for any element g of a partition of unity D we have 

mi(A n #) m 2 (A n g) [p^ + p 2 m 2 ](A n g) 

ViK9) 1*2(9) M#) 

Let us sum this inequality over g E D and pass to the supremums with respect to m\ and 
m 2 . As a result we obtain the inequality 

PlT„(/Ji, -D) + P2T n (fX 2 , D) < r n (n, D). 

It means that the functional r n (/i, D) is concave with respect to /1. This along with (28) 
implies the concavity of t(/i). □ 

Let D = {<7i, . . . ,gk} be a partition of unity in a base algebra C. We eliminate from 
this partition the elements gi such that A n gi = and put D' n := {g £ D \ A n g 7^ 0}. 
Denote by M(D) and M(D f n ) the sets of all probability measures on finite sets D and D' n , 
respectively. Clearly, these M(D) and M(D f n ) are finite-dimensional simplexes. If one 
extends each measure \x G M(D' n ) by zero to D \ D' n) then the simplex M(D' n ) becomes a 
certain face of M(D). Note that formula (39) defines the functions r n (fj,,D) not only for 
measures \i G M(C) but also for measures \i G M(D). 

Proposition 8.5 The function r„( • ,D) is continuous on the set M(D' n ) and it is equal 
to -00 on M(D)\M(D' n ). 

Proof. Let us take a measure wl G M(C) such that m'(A n g) > for all g G D' n . Fix a 
(small) positive number e. For every measure m G M(C) consider the new measure 

m £ := (1 — e)m + em'. (43) 

Then for all g E D' n the following inequalities hold 

m £ (A n g) > em\A n g) > 

and, on the other hand, 

m £ (A n 9)<U n l\\. 
These inequalities imply the existence of a (large) number C(e) such that 

\\nm £ (A n g)\ < C(e) for all m G M(C), 9 G^. 

Let us introduce the notation 

^(A*,m) := v(9)^m(A n g), (44) 

ip £ (li) := sup if)(n,m e ). (45) 

meM(C) 
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Then for all measures fx, v e M(D' n ) we have 



m £ ) - ij){y, m e ) \ < C{e) \n(g) - u(g) \ . 



Therefore the function ip e (fj) depends continuously on /i e M(D' n ). 
Note that (43) and (45) imply 



(46) 



On the other hand since m £ > (1 



e)m we have 



^ e (/x) > ^ (A*) + ln(l-e). 



(47) 



In view of (46) and (47) the function t/>o (/-*■) is the uniform limit of the functions tp £ (fj) as 
e — > 0. Thus it is continuous as well. Finally, the difference of two continuous functions 



coincides with r n (/x, D) and so it is continuous on the set M(D' n ). 
The second part of the proposition follows from (29). □ 

Proposition 8.6 The functional r(/x) is upper semicontinuous on M{C). 

Proof. By Proposition 8.5 the function r„(/i, D) is upper semicontinuous (with respect to 
*-weak topology) on M{C). Therefore, t-entropy 



also possesses this property. □ 

The definition of r(fi) is rather complicated and its reduction to a simpler form in 
general is problematic. Let us present an example when the corresponding calculation 
leads us to simpler expressions. 

It is also worth remarking that the explicit calculation of r(/i) for a concrete measure 
gives us a lower estimate for the spectral potential. 

Example 8.7 Throughout this example we confine ourselves to the case of Perron-Fro- 
benius operator (3) when the preimage of every point consists of finite number of points. 

First, let x be a fixed point of the mapping a. Then the measure \i = 5 XQ is a 
probability invariant measure. 

Let D = {go,gi, . . . ,gk} be a partition of unity. Then fi(gi) = gi{xo). Since in (28) 
we take the infimum over all partitions of unity it follows that one has to consider the 
partitions for which ^o(^o) = 1 an d gi(x ) = for i ^ 0. Then the sum in expression (29) 
for T n (/i, D) reduces to a single summand 




t(h) = inf T n (n,D)/n. 



n,D 



T n (n,D) = sup \nm(A n g ). 



(48) 



meM 
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For m — 11 — S XQ we have m(A n g ) = A n g (x ). By assumption the operator A acts 
according to formula 

A g( x ) = Yl ^(y)d(y), 

where ip G C(X) is a fixed nonnegative function, and thus 



A»g(x )= Yl 

y€a~ n (xo) 



n-1 

n^K'(y)) 

.3=0 



g{y)- 



Given n and x , one can choose a partition of unity such that go(y) = for y e a n (x ), 
y 7^ Xq. Then A n g (^o) = i ) { x o) n an d therefore 

Tn{n,D) > nln^(xo) 

and 

r(/i) > ln^(x ). 

One can find an upper estimate for the supremum in (48) by using the following rea- 
soning. Given n and e > one can choose a function g satisfying the conditions men- 
tioned above and having the support in a sufficiently small neighborhood of the point xq. 
Then A n go(x) < A n go(xo) + e and it follows that for any probability measure m one has 
m(A n g ) < A n go(xo) + e. Thus in the situation considered we have 

t(S X0 ) = lnip(x ). 

Now suppose xo is a periodic point with the period N, that is a N (xo) = x$. Then the 
measure 

N-1 



3=0 

is an invariant probability measure. Applying the foregoing reasoning to the operator A N 
we obtain 

1 

r(6 XOtN ) = -lnH^(x )). (49) 

3=0 

In the wavelet theory there arises the so-called subdivision operator, which is defined in 
the following way. Let T m = R m /Z m be the m-dimensional torus. We consider a matrix M 
with integer elements such that the absolute values of all its eigenvalues are greater than 1. 
It generates the mapping a: T m — > T m according to the formula a(x) = Mx (mod Z m ). 
This mapping preserves the Lebesgue measure and it is an expanding map, and the 
number of preimages of every point is | detM|. 

The subdivision operator acts on the space L 2 (T m ) according to the formula 

S^u(x) = a(x)u(a(x)) , 

where a is a fixed function. It is a weighted shift operator generated by irreversible 
mapping. This operator is exploited in the iteration procedure of the construction of 
the wavelets base. The information on its spectral radius is required for the convergence 
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guarantee of the procedure mentioned. As we have already observed in Introduction the 
spectral radii of weighted shift and transfer operators are tightly related (cf. (9)). Their 
interrelation will be discussed further in Section 13. 

In the paper by Didenko [24] a series of lower estimates for the spectral radius of the 
subdivision operator was obtained. In fact, some of these estimates has been derived with 
the help of the usage of periodic points of the mapping a. 

We dwell on this in such detail to emphasize the fact that such estimates follow 
directly from (49). To be true, we have to note that in [24] there was considered a more 
complicated situation as well, namely the subdivision operators with matrix coefficients 
in the space of vector functions. The whole of the theory presented in this article is not 
applicable readily to these operators. 

9 Entropy Statistic Theorem 

In this section we prove a certain theorem on the statistic of distribution of empirical 
measures. This result is important in its own right and plays for r(/x) the role similar 
to that the Shannon-McMillan-Breiman theorem plays for h(fj,) (see, for example [20], 
Chapter 4). The Entropy Statistic Theorem (Theorem 9.1) not only uncovers the statis- 
tical nature of r(/x) but also serves as the main technical instrument in the proof of the 
Variational Principle. 

Consider a dynamical system (X, a) , where a : X — > X is a continuous mapping of a 
Hausdorff compact space X. Recall that by M we denote the set of all Borel probability 
measures on X. Let x be an arbitrary point of X. The empirical measures 5 x . n G M are 
defined by the formula 

M/) = ^ (/(*) + /(«(*))+ ••• +f{a n -\x))), feC(X). (50) 

Evidently, the measure 5 XjTl is concentrated on the trajectory of the point x of length n. 

We endow the set M with the *-weak topology of the dual space to C(X). Given 
a measure /i e M and its certain neighborhood O(fx) we define the sequence of sets 
X n (0(fi)) as follows: 

X n (0(fi)):={xeX\5 x , n eO(fi)}. (51) 

Theorem 9.1 [Entropy Statistic Theorem] Let (X,a) be a dynamical system and 
A : C(X) — > C(X) be a certain transfer operator for (X,a). Then for any measure 
/i e M and any number t > r(/i) there exist a neighborhood 0(/i) in the *-weak topology, 
a (large enough) number C(t,fi) and a sequence of functions Xn £ C(X) majorizing the 
index functions of the sets X n (0(fj,)) such that for all n the following estimate holds 

\\A n Xn \\<C(t,fi)e nt . 

To prove this theorem we need a number of auxiliary results and we start with their 
consideration. 

Let us fix a natural number n and a partition of unity D in C(X). Let, as above, D' n = 
{ g e D | A n g ^0}, and the symbols M(D) and M(D' n ) denote the finite dimensional 
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simplexes consisting of all probability measures on D and D' n , respectively. As it was 
observed the simplex M(D' n ) is naturally embedded in M(D): the measures from M(D' n ) 
can be extended onto D \ D' n by zero. Given fi G M(D' n ) there exists a measure m M G M 
at which the supremum in (29) is attained. In other words, 



r n (fa -D) = sup £ /i(#) In ^ff = £ In '^LMl. (52) 



In general the correspondence /i >— > m M may be not single- valued. However, for convenience 
of presentation we will assign to every measure \i G M(D' n ) a certain single measure 
m M G M satisfying the equality (52), and thus we will assume that there is fixed a single- 
valued mapping fj, i— > m M . 

Lemma 9.2 Given a single-valued mapping \i \— > m M , satisfying equality (52), every 
summand (j J (g)\n(m IM (A n g) j fa[g)) in the right-hand part of (52) is a bounded function 
with respect to /i G M(D' n ) and tends to zero when /i(g) — > 0. 

Proof. First, note that m ll (A n g) < ||A n l||. Hence the expression fj,(g)\n(m^(A n g) / fx(g)^j 
is bounded from above, and its sup lim as /x(g) — > is nonpositive. By Proposition 8.5 the 
function r n (/j,,D) depends continuously on fi G M(D' n ). Thus, it is bounded on M(D' n ). 
So, as all the summands in the right-hand part of (52) are bounded from above, the 
forgoing reasoning shows that they are bounded from below as well. 

Now we finish the remaining part of the proof arguing by contradiction. Let go G D' n . 
Suppose that the value of the expression 

M#o) In * rr 

does not tend to zero as fa[go) — > 0. Then there exists a sequence of measures fa G M(D' n ) 
and a number e > such that 

m(go) - (53) 



and at the same time 



ft Wln^^<- £ . (54) 
Wo) J 



Without loss of generality one can assume that the sequence fa tends to a certain measure 
v G M(D' n ). The corresponding sequence of measures m Mi G M has at least one limit 
point m G M. Passing to subsequences one can gain for every g G D' n the equality 

lim m^{A n g) = m{A n g). 

Let us consider any function g G D' n . Obviously, if v{g) > 0, then 

lim fa(g) \nm^{A n g) = v{g) \nm(A n g); 

if u(g) = 0, then, as it has been already observed, 

lim sup fa(g) In m IXi (A n g) < 0; 
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and if g = go, then (54) and /^(<?o) — > imply 

limsup^(g-o) \nm^(A n g ) < -e. 

Observe also that since u(go) = it follows that the corresponding summand in (29) for 
T n (v, D) is zero. Now the foregoing relations and this observation imply 

/ x , / m m u (A n g) sr^ , \^ m{A n g) 
hm sup T n (fn, D) = hm sup > «;(a)m — ' < } v(g) In — e. 

In view of (52) the right-hand part of the latter inequality does not exceed r n {y, D) — e, 
which contradicts the continuity of the restriction of r n ( • , D) to M(D' n ). □ 

Denote by Int M(D' n ) the set of measures /i G M(D' n ) that take positive values on all 
elements g G D' n . Let /i G lntM(D' n ) and v G M(D). Put 

r n (u,n,D):= ^(a)m^T^ (55) 
where the measure 6 Mis defined by equality (52). 

Lemma 9.3 Given a single-valued correspondence fi i— > i/iai satisfies (52), then for 
every measure /i G M(_D[J and any number t > r n (fi ,D) there exists a neighborhood 
0(/io) in the set M(D) and a measure jj, G 0(/io)nlnt M(D' n ) such that for all v G 0(/io) 
the following estimate holds: T n (u,fi,D) < t. 

Proof. Let D' n = {g 1: . . . ,gt}, and /ii be the center of the simplex M(D' n ). It is defined 
by the equalities 

Vl(9i) = l / k for a11 9i e D 'n- 

Let us consider the family 

He = (1 - 0Vo + 0A*i- 

For small positive 6 this family belongs to Int M(D' n ). Denote by Oq{ho) the set of 
measures v G M(D) satisfying the inequalities 

W(.9i) ~ M9i)\ < for a11 9i e ^n- 

Clearly 

/ie G Oo(no)- 

By Proposition 8.5 the function r n (fig,D) depends continuously on the parameter > 
and when 8 is small T n (fig,D) differs in a small way from r n (fio,D). Therefore, it suffice 
to prove that if v G Og(fi ) then the difference 



T n (u, fl 6 , D) - T n (flg, D)=Y] ( ~~T~T ~ 1 J X ln 



mjj^9) 



(56) 



tends uniformly to zero as # — > 0. This can be derived in a simple way from Lemma 9.2. 
Indeed, if Ho(g) > 0, then for small the corresponding summand in (56) has small first 
factor and bounded second one; and if fio(g) = then the first factor is bounded while 
the second one is small. □ 
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Lemma 9.4 If (J, & hit M(D' n ) and a measure m M G M satisfies equality (52), then 

Proof. Lemma 9.2 implies that m^(A n g) > whenever > 0. Let us take any measure 
m G M and consider the function 

/ x, (l-t)m u (A n g)+tm(A n g) . . 

^(t) = M ln , * e [0, l]. 

By (52) this function attains the maximal value at t = 0. Therefore its derivative at zero 
is nonpositive: 

Since the measure m G M is arbitrary this implies the statement of the lemma. □ 

Lemma 9.5 If a function g G C(X) is nonnegative and A n g = 0, then for every positive 
integer N the following equality holds true 

A N+n (e SN9 ) =A N+n l, 

where S^g is defined by (21). 

Proof. Evidently, the function A t+n (goa 1 ) is nonnegative. On the other hand the homo- 
logical identity implies 

A i+n (goa l ) = A n (gA i l) < A n g ■ = 0. 
Hence A l+n (goa % ) = and thus 

N 

A N+n {S N g) = ^^oa*- 1 ) = 0. (57) 
i=i 

Recall now that the function S^g is nonnegative and bounded. By the Lagrange mean 
value theorem we have 

e SN9 _ j < e W S N9\\S N g. 

Applying the operator A N+n to this inequality and exploiting (57)we obtain 

A N + n ( e SN9 ) - A N+n l < e^ s ^A N+n {S N g) = 0. 
On the other hand, e Sn9 > 1, and therefore 

A N+n (e Sn9 )>A N+n l. □ 



Now we can prove the Entropy Statistic Theorem itself. 
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The proof of the Entropy Statistic Theorem. Let us fix a measure /x G M and 
an arbitrary number t > r(/i ). Choose a natural number n and a partition of unity D 
in C(X), satisfying the inequality 

r n (/i , £>) < nt. 

as above, we denote by D' n the union of functions g <E D such that A n g ^ 0. 

Suppose first that fi £ M(D' n ). In this case there exists a function g E D\D' n satisfy- 
ing the inequality Ho(g) > 0. Take the number e = /io(g)/2 and define the neighborhood 

O( t i ):={ t ieM\ t i(g)>e}. (58) 

Let x G X n (O(ii )) for some N, where X n (O(ii )) is that defined by (51). Then the 
empirical measure S x> n belongs to 0(/iq) and hence 

S N g(x) = N8 x , N (g) > Ne. 

Define the function xn by the formula 

where C is arbitrary positive. Clearly, xn majorizes the index function of the set 
X N (O(fj, )). If in addition N > n, then 

XN < e C{SN - n9+n ~ Ne) = e Cn-CNe e S N _ n (Cg)_ 

This along with Lemma 9.5 implies 

A N X N = e Cn ~ Cm A N l < e Cn e N(ln\\A\\-Ce)^ (5Q) 

If we choose C so large that In \\A\\ — Ce < t, then (59) implies the Entropy Statistic 
Theorem. 

To complete the proof it remains to consider the situation when /io G M(D' n ). By 
Lemma 9.3 in this case there exists a neighborhood O(/i ) C M(D) and a measure ji G 
O(fx ) n Int M(D' n ) such that for every v G O(fi ) we have 

r n (v,fx,D) < nt. 

Define the function 

^(x) := £ g(x) In + £ <?(*) In 1, (60) 

where the measure m M G M satisfies equality (52). Note that the second sum in (60) is 
equal to zero. Therefore by comparing (60) and (55) we see that 

S N ^(x) = -NT n (S x , N , f jL,D). (61) 

On the other hand the convexity of the exponent implies 
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Applying the operator A n to (62) and exploiting Lemma 9.4 we obtain the estimate 

A n (e*) < 1. (63) 

Let us introduce the notation 

= ^ + f a " H h %l)oa n{k ~ l) . 

The homological identity and estimate (63) imply 

A nk (e^) = A n (e*A n (e*...A n (e*A n {e*))..)) < 1. (64) 

By construction the function ip(x) has a finite norm ||^||. Consider an integer N > n 
and take a natural k such that N e [n(k + 1), n(k + 2)]. Then 

n-1 

< S nk i) + 2n HVII = J2^ oat + 2n II^H • ( 65 ) 

i=0 

If x E Xn(0(/io)) then the empirical measure 5 Xt N belongs to 0(/io) (see (51)) and by the 
choice of this neighborhood, 

T n {&x,N,IJ>,D) < nt. 
In this case (61) along with (65) imply 

N 11 n_1 

Nt > -r n (5 x>N ,ti,D) = —S N iP(x) > — J2i> k oa l (x) - 2 W 
n n n 

for every x G Xn(0(/iq)). Therefore, the function xn defined by the formula 

X n(x) :=exp\Nt + 2\\^\\ + -J2^ k oa t (x)\ 
< n »=o > 

majorizes the index function of the set X n (O(ii )). 

Recall the Holder inequality: if m is a linear positive functional on C(X) and fi, . . . , 
f n are nonnegative functions from C(X) then 

m (/ 1 .../„)<(m(/f)...m(/;)) 1/B 

Applying this inequality to the functional m(f) = [A N f~\(x) and the function / = xn, 
we obtain the estimate 

n— 1 j, 

[A^jOr) < e^H^II J] [(^e^ " 4 )^)] . (66) 

i=0 

In addition, the homological identity and (64) imply 

A N^ k oa^ = A N-i-nk A nk^ kA i-^ < \\ A \\ N ~ nk A ^ e ^ < pH^^. (67) 

Estimates (66) and (67) imply the Entropy Statistic Theorem. □ 

Now, by means of this theorem, we can prove the Variational Principle. 
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10 The proof of the Variational Principle 

By Theorem 5.5 we have already established that 

A(y?)> sup (//(</?) +r(//)), (68) 
»eM s (C) 

and therefore we have to prove the opposite inequality. Its proof needs some auxiliary 
results. 

Proposition 10.1 The sets M(C) and M S (C) are compact in * -weak topology. 

Proof. These sets are closed subsets of the unit ball in the dual space to C and by the 
Alaoglu theorem the latter ball is compact. □ 

Proposition 10.2 Let (X, a) be the dynamical system corresponding to a C* -dynamical 
system (C, 5) and C be identified with C(X) (see 6.1, 6.4). If U is a neighborhood of the 
set M$(C) in M{C) then there exists a (large) N such that for all n > N and x G X one 
has 5 x>n G U , where 5 X;7l is the empirical measure defined by (50). 

Proof is implemented by contradiction. Suppose that there exist sequences x k G X and 
nk — > oo such that S Xkiflk 4- U- By the compactness of M{C) the sequence S XktTlk possesses 
a limit point 

H G M(C) \ U. (69) 
Clearly for every / G C the following equality holds 

S Xk , nk (f) ~ S Xk , nk (foa) = n k l (f(x k ) - f(a nk (x k )). 
As a limit one has //(/) = /i(foa) and so fi G Ms(C), which contradicts (69). □ 

Now we can finish the proof of the Variational Principle. 

Hereafter we exploit the identification C = C(X) explained in 6.1, 6.4. 

Consider the functional 

A(y?):= sup (^)+r(^)). (70) 

First we prove the equality 

\(<p)=A(<p), ipeC. (71) 

Choose arbitrary numbers c > A(</?) and e > 0. On the set of invariant measures Ms(C) 
we define the functional 

t(n) :=c-n((p). 

Clearly, 

t(n) > r(/i). 

For every measure fi G M$(C) we choose a (small) neighborhood 0(/x) in M{C) such that 
it satisfies the Entropy Statistic Theorem (with the number t = t(/i)) and, in addition, 
for all v G 0{p) the following inequality holds: 

u((p) < fi((p)+e. 
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Then for every point x G X n (0((j,)) we have 

S n ip(x) = n5 X!n (ip) < n(n(<p) + e) = n(c - t(/i) + e). (72) 

By Proposition 10.1 the set Mg(C) is compact. Let us cover it by a family of neighborhoods 
of the type mentioned, and let 0(/ii), . . . , O(fik) be a finite subcover consisting of the 
elements of this family. The Entropy Statistic Theorem assigns to each neighborhood 
0(/ii) a sequence of functions Xi,n majorizing the index functions of the sets X n {0{jj„i)). 
Proposition 10.2 implies that for all sufficiently large n every point x G X lies in a certain 
set X n (0(fXi)), % — 1, . . . , k. Hence, 

Xl,n + ■■■ + Xk,n > 1 

Now (72) along with the Entropy Statistic Theorem imply the estimates 

k k 

A%\ = A n (e s ^l) < ^ n (e" (c-t(w)+£) XvO ^ Yl e n{c - t{ » i)+e) C (t(m) , ^e 71 ^ . 

i=i i=i 

As n — > oo these estimates imply 

A(y?) < c + e. 

Since the numbers c > A(ip) and e > are arbitrary the following inequality holds 

AM <A(<p). 

In view of (68), (70) this implies (71). 

Finally, to complete the proof of (40) we recall that by Proposition 8.6 t-entropy r(/x) 
is upper semicontinuous on the compact space M$(C). Therefore the supremum in (70) is 
in fact maximum and thus (40) is proved. □ 



11 Variational Principle for transfer operators 
with nonnegative weights 

In this section we prove a C*-algebraic version of Theorem 5.8. 

Let (C, 5) be a C*-dynamical system and let (X, a) be the corresponding dynamical 
system. Let A : C — > C be a fixed transfer operator for (C,5). We define the family of 
operators At/; : C — > C, where ip G C, as 

Aij):=A(il)-). (73) 

Evidently, if ip > then is a transfer operator. 

Remark 11.1 1) If ^ > then 

= A lnV ,, 

where, as above, A v = A(e lp ■ ). 

2) Given a transfer operator A : C — > C one can also consider the family of operators 
■0^4 for ip £ C. The homological identity (36) implies the equality -^A = A5(ip). Thus this 
family is a subfamily of the one considered in (73). 
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Theorem 11.2 [Variational Principle for transfer operators with nonnegative 
weights] Let (C, 5) be a C* -dynamical system and (X, a) be the corresponding dynamical 
system. Let A: C — > C be a certain transfer operator for (C, 5), Aip be a transfer operator 
defined in (73), where tp G C and ip > 0, and £(ip) be the logarithm of the spectral radius 
of Aip. Then the following variational principle holds true 

eM>)= max ( [ \n^dfi + r(fi)) (74) 
iieM s (C) \J X J 

(where in the notation we exploit the identification of C with the functional space C (X) 
(see 6.1) and, in particular, we treat In ip as a function on X, and we also identify the 
functionals \x G Ms(C) with a-invariant Borel probability measures on X (see 6.8)). 

Proof. Theorem 6.10 implies (74) for any positive function ip G C = C(X). So we have 
only to consider the case when ip has zero points. 

Let ip n := ip + 1/n. Evidently, the sequence ip n > is monotonically decreasing and 
tends in norm to ip. The definition of £(ip) implies that 

This observation along with upper semicontinuity of the spectral radius implies the exis- 
tence of the limit 

hm^ n )=^). (75) 

The functions ip n satisfy (74). Hence for each \i G M a (C) we have 

Z{ip n ) >//(lnV„)+r(/i). (76) 
By the choice of ip n and Levi's monotonic limit theorem we have 

lim (J,(lnip n ) = fxQjatp), 

which along with (75) and (76) means that 

£(if>) > //(In ^) + r(/i), » G M a (C). (77) 

Now let ii n G M S (C) be a measure such that 

£(ip n ) = (j, n (lnip n ) + r(/i„), (78) 

and let fiQ be a limit point of the sequence {//„}. Evidently, G Mg(C). 
Since ip k > ip n for k < n, equality (78) implies also that 

t(lpn) < VnQ-nipk) + T(Hn), k < H. 

This inequality along with (75) and the upper semicontinuity of r(/i) (Proposition 8.5) 
implies 

< ^o(ln^fc) + t(hq). 

Finally, passing to the limit and applying Levi's monotonic limit theorem, we get 

e(i/>) </i (ln^) + r(/i ). 
Which along with (77) proves (74). □ 
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12 Additional properties and alternative 
definition of t-entropy 

The Variational Principle tells us in particular that t-entropy plays the principal role in 
the spectral analysis of transfer operator and this section is devoted to the description of 
a number of additional properties of t-entropy that have not been exploited in the proof 
of the Variational Principle. Along with this we will also present a certain alternative 
definition of t-entropy. 

Proposition 12.1 If n e M S (C) then 

r n +M < r n (n) + r k (fi). (79) 

Therefore the t-entropy of a 5-invariant measure \i can be defined as the limit 

r(„) = hm ^M. 

Proof. Let us consider two partitions of unity D, E in the algebra C. For every two 
elements g G D and h G E we define the function 

fgh ■■= ghoa k . 

Consider the new partition of unity 

C:={f gh \gED, hEE}, 



and introduce the notation 



and 



D li :={geD\ t i(g)>0}, 



E >1 :={heE \ fi(h) > 0}. 

Suppose first that A n h = for a certain h G E^. Then by the homological identity we 
have 

A n+k f gh = A n (hA k g) = 0. 

Therefore, 

r n (/i, E) = r n+k (ii, C) = -oo 

and the proposition is proved. 

Now let all the elements A n h, where h G E^, be nonzero. Then there exist measures 
m G M(C), satisfying the condition m(A n h) > for all h G E^. These measures by 
the 5-invariance of the measure \i and concaveness of the logarithm function satisfy the 
relations 

\- ( , u m(A n+k f 9h ) ^ m{A n h) 

^ " {hh)Xn ^AU) ^^ )ln ^T 

f gh ec r\Jgh) heE/i ) 
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-k M, \W)k ) 



= > n(g)]n — -t-^ 1 , where m'( • ) = > — ^ — ) - ' f v y . 

One can easily verify that m' G M(C). By varying the measure m in the latter relation 
we obtain the inequality 

T n+k (n, C) < r n (n, E) + Tk(n, D), 

which implies (79). □ 

12.2 For any partition of unity D in the algebra C(X) and any positive number e we 
denote by W(D,e) the set of all partitions of unity E in C(X), satisfying the following 
condition: if g G D and some function h G E does not vanish at points x±,x 2 £ X then 
\g(xi) — g(x 2 )\ < e. In other words, each function g G D has small oscillation on the 
support of any h G E. 

Proposition 12.3 Given a measure fi G M(C(X)) and a partition of unity D in C(X), 
and a real number t > r„(/i, D), there exists a (small) e > such that for every partition 
of unity E G W(D,e) the estimate T n (fx,E) < t holds true. 

Proof. Let 

D li :={geD\ fJ i(g)>0}. 

Take a (small) positive number e such that for every g G the following inequality 
holds 

Consider any partition of unity E G W(D,e), where W(D,e) is defined in 12.2. Let 

:= {h G E | /i(h) > 0}. 
Then for every measure m G M(C(X)) the concavity of the logarithm implies that 



< 

If <? G -D^ and /i G E^, then the definition of W(D,e) implies 

H{h) 
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<8i) 



Substituting this inequality into (81) we obtain 



m(A n h) . m(A n (g + e)) , nn . 

S^ )h icT s £'* ,,B -Sfe Ji - (82) 

Now it suffice to check that for small e the sum in the right-hand part of (82) does 
not exceed t. Indeed, if it is so then passing to the supremum over m G M(C(X)) we 
obtain the estimate r n (/x, E) < t we are looking for. 

If there exists a function g G such that m(A n go) < y/e, then evidently 

m(A n (g + e))< y /i + e\\A\\ n , 

and the whole of the sum (82) does not exceed t in view of (80). 

In the opposite case for every function g G the following estimate holds 

m(A n (g + e))<m(A n g)(l + Vi\\A\\ n ). 

It implies that the sum in the right-hand part of (82) does not exceed 

T n ( f i,D) + ]n(l + y /i\\A\\ n ). 

And the latter expression is less than t whenever e is small enough. □ 

Recalling the identification of C with C(X) we are giving now an alternative defini- 
tion of t-entropy. Let (C, S) be a C*-dynamical system and (X, a) be the corresponding 
dynamical system. Let A be a transfer operator in C(X) (we identify C with C(X)) and 
let A* : C*(X) -> C*(X) be the adjoint operator to A. The formulae (28), (29) can be 
rewritten in the following way: 

T ^) :=iB £ljM i T n (fj) :— infr n (//, D), (83) 

n n D 

, / x , A* n m(q) , ^ 

r n {n,D):= sup Y>(<?)ln . (84) 

meM(C(X)) ^ M 

Here D denotes a continuous partition of unity on X. 

In fact these formulae have sense for arbitrary measurable partitions of unity as well. 

Let us consider any measure \i G M{C(X)). A finite Borel partition {Gi, . . . , Gk} of 
the space X will be called fi-proper, if the boundary of each set Gi has zero measure \i. 

A Borel partition of unity D = {g ± , . . . ,g k } on X will be called fi-proper if it consists 
of index functions of sets Gi C X forming a /i-proper partition of X. 

Proposition 12.4 Given an open cover W of a Hausdorff compact space X and a mea- 
sure /i G M{C(X)), there exists a finite fi-proper partition of X refined in W . 

Proof. Without loss of generality one can assume that the open cover W is finite. Let 
us consider the case when it consists of two sets U , U±. Suppose that the closed sets 
U i = X\Ui are nonempty (in the opposite case the statement is trivial). By the Urysohn's 
Lemma there exists a continuous function / : X — > [0, 1] vanishing on Uq and equal to 
unit on U\. By the finiteness of the measure \i there exists a number c G (0,1) such 
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that /i(/~ 1 (c)) = 0. Now one can take the partition of the space X consisting of two 
sets Go = / _1 ([0, c)) and G\ = / _1 ([c, 1]). In the general case one should apply the 
induction by the number of elements of the cover and use on every step the construction 
described. □ 

Proposition 12.5 The values of the junctionals r(/i) and r n (/i), where /i G M{C(X)), 
do not change if in formulae (83) and (84) one uses \i-proper partitions of unity instead 
of continuous partitions of unity D . 

Proof. Note first that Proposition 12.3 holds true if in this proposition one considers 
arbitrary Borel partitions of unity instead of continuous ones and defines the functional 
T n (/i,D) by means of formula (84). Its proof stays the same, one should simply replace 
everywhere m(A n ■ ) by A* n m( ■ ). 
Define the functional 

r M ■= infr n (/i,L>), 

where the infimum is taken over all /i-proper partitions of unity D. Propositions 12.4 and 
12.3 (for the Borel partitions) imply 

tM < r n (fi). 

So it suffice to prove the opposite inequality. 

Fix a number e > 0. Choose a /z-proper partition of unity E — {hi, . . . , h^) such that 

T n (fi,E) < rM+e. 

By definition the sets Vi = h^il) form a /z-proper partition of X. Denote by Y the union 
of the boundaries of these sets. Clearly, jj,(T) = 0. Let 5 £ be a positive and small enough 
number (it will be indicated below how small it should be). By the regularity of the 
measure \i the set T possesses a neighborhood 0(T) such that 

MO(H) < s e . 

Let us consider the open cover of X consisting of the sets 

o(r),vi\r,...,y fc \r 

and choose a continuous partition of unity D = {go,gi, ■ ■ ■ ,gk} refined in this partition 
(in such a way that suppgo C 0(T) and supp gi CK \ T). Then 

9i<hi<gi + g , % = 1, . . . , k. 

In addition, 

»(g ) < /i(0(r)) < 5 £ . 

Therefore, 

\fi(gi) - fi(hi)\ < 5 e , i = l,...,k. (85) 

Let E' n be the set of all the functions hi G E for which there exists a measure m' G 
M(C(X)) such that A* n m'(hi) > 0. Fix a measure m' G M(C(X)) satisfying the condition 

A* n m\hi) > for all hi G E' n . 
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Let m G M(C(X)). Consider the measure 



m £ — (1 — e)m + em! . 



It satisfies the inequalities 



k 



m(A n gi) 



k 



m e (A n gj) 



ln(l - e) + J^(ft) h 



i=0 



i=0 




k 



A* n m £ (hj) 



(86) 



i=i 



It suffice to check that the whole of the sum (86) does not exceed T n (/i, E) + e. Since, 
if it is so, then taking the supremum over m G M(C(X)) one obtains the estimate 



The first summand in (86) depends continuously on the value of n(go). Since /i(go) < 
/i(0(r)) < 5 £ , this summand can be made arbitrarily small by the choice of 5 £ . Defini- 
tion (84) implies that if one replaces in the second summand in (86) all the functions g± 
by then he obtains the expression not exceeding r„(/i, E). Therefore it suffice to check 
that the second summand in (86) changes in a small way under this replacement. 

Let us consider each summand in the right-hand sum of (86) separately. 

If hi G E' n then by the construction one has 



These two inequalities imply that the corresponding summand in (86) depends continu- 
ously on the value fx{gi) and therefore will change in a small way on the replacement of 
fx(gi) by fx(hi) (in view of (85)). 

If hi E' n and concurrently fi{hi) > 0, than one has > (again in view of (85)). 

So in this case the corresponding summand in (86) and the whole of the sum (86) are 
equal to — oo. 

Finally, if hi ^ E' n and fi(hi) = 0, then fi(gi) < fi(hi) = and the corresponding 
summand in (86) is equal to zero. As a result we have that the whole of the sum (86) 
does not exceed r„(/i, E) + e as soon as the number 5 e in (85) is sufficiently small. □ 

This proposition naturally leads to the following 

Definition 12.6 Let (C, S) be a C*-dynamical system and (X, a) be the corresponding 
dynamical system. Let A be a transfer operator in C(X) (we identify C with C(X)) and 
let A* : C*(X) — > C*(X) be the adjoint operator to A. T -entropy is the functional r on 



ln(l - e) + Tn (ji, D) < Tn (ji, E)+e< t'M + 2s, 



and by the arbitrariness of e it implies that 



r n (fi) < <(//). 



A* n m £ (hi) > eA* n m\hi) > 0, 



and on the other hand 



A* n m £ (hi) < \\A\\ 
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the set M a (of all a-invariant Borel probability measures on X) defined by the following 
formulae 

rfr) := inf T -M T n (/i) := inf r„(/i, D), (87) 

n n D 



T n (n,D):= sup ^n(g)\n. 



A* n m(g) 



m£M(C(X)) ^ M#) 

Here -D denote /z-proper partition of unity. 

Proposition 12.5 tells us that this definition is equivalent to definition (38), (39). 

Remark 12.7 To finish this section let us note that the first to be introduced was the def- 
inition of t-entropy leaning on /z-proper partitions, and namely this definition is presented 
in [4, 5, 6, 7]. 



13 Weighted shift operators and 
Variational Principle 

Example 7.6 shows the tight interrelation between transfer operators and weighted shift 
operators in L 1 (F, to). Developing the idea of this example we go further and present in 
this section the Variational Principle for the spectral radius of weighted shift operators 
acting in L p (Y,m) spaces. 

Model example. Just as in Example 7.6, let (Y, 21, to) be a measurable space with 
a o"-finite measure to, and f3 be a measurable mapping of Y into itself satisfying the 
condition 

m(p-\G)) < Cm(G), G e 21, (89) 

where the constant C does not depend on G. Let us consider the space L p (Y,m), where 
1 < p < oo. Set the shift operator T (generated by the mapping 0) by the formula 

[T/Kx) = /(/?(*)), feL p {Y,m). (90) 

Inequality (89) implies that the norm of this operator does not exceed C 1 ^. 

Note that in the case when p = oo this operator defines an endomorphism of the 
algebra L°°(Y, to), that is 

T(fg) = Tf-Tg, f, g E L°°(F, to). 

Just as in Example 7.6 we define the linear operator A : L°°(Y, m) — > L°°(Y, to) by 
means of the identity 

J f-go/3dm = J(Af)gdm, g e L\Y,m), f e L°°(Y,m) (91) 
(in other words, A is adjoint to the shift operator T on L 1 (y, to)). 
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The definition implies that A is a positive operator and it satisfies the homological 
identity 

A((Tf)g)=fAg, f,geL°°(X,m). (92) 

Therefore A is a transfer operator (for the C*-dynamical system (L°°(Y, m), T)). And in 
the case of a /^-invariant measure m it is a conditional expectation operator. 

Recall (see Section 6) that the C*-dynamical system (L°°(Y,m),T) can be canonically 
identified with the common dynamical system (X, a), where the compact space X is the 
maximal ideal space of the algebra L°°(Y,m), and the mapping a: X — > X is continuous. 
Under this identification the functions from L°°(Y, m) are identified with the elements of 
C(X), and the shift mapping / i-> /o/3 on L°°(Y, m) is identified with the shift mapping 
g \— > goa on C(X). Finally, as it was noted in Remark 7.4, the set M a (X) of all a- in- 
variant probability measures on X is identified with the set of all /^-invariant finitely 
additive probability measures on Y which are absolutely continuous with respect to m. 
We will denote the latter set by Mp(Y,m). 

Since the set Mp(Y,m) consists of finitely additive measures one can come across 
certain difficulties when defining the integrals by these measures for unbounded functions, 
and namely such integrals are needed in the next theorem. Fortunately, we can introduce 
them in a rather natural way by using the corresponding measures on C(X). Namely, 
let ip G L°°(Y,m) and ip G C(X) be its Gelfand transform, let also fx G M /3 (Y,m) and 
fx G M a (X) be the corresponding measure mentioned above. Then we set 

/ ln\^\dfi:= / \n\*p\dfi. (93) 

For any tp G L°°(Y, m) the operator ipT acting on L P (Y, m) and given by 

4>T:f^i>-Tf 

will be called a weighted shift operator (with the weight tp). Note, in particular, that 

Tip = T(iP)T, *P G L°°(Y, m). (94) 

Theorem 13.1 [Variational principle for weighted shift operators] For the spec- 
tral radius of the operator tpT : L p (Y,m) — > L p (Y,m), 1 < p < oo, the following varia- 
tional principle holds: 

]nrN>T)= max ( f In |^| d\i + ] , (95) 

neM (Y,m) \J Y p J 

where r(/i) is the t-entropy assigned to the transfer operator (91) and the integral is 
understood in the sense of (93) . 

Proof. In the case when p — 1 the operator A is adjoint to T. Therefore, the operator 
Aip is adjoint to ipT, and they have the same spectral radii. Thus, Theorem 11.2 implies 

In r(ipT) — In r(Aip) — max I / In l^l d/i + r(u) J . (96) 

fieM (Y,rn) \J Y J 
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Now let us consider the case p > 1. Note that for every function / e I^Y, m) one has 



/ \^T) n f\ p dm = / TT|^o/3 < | p |/o/3»| p dm= / (\^\ p T) n \f\ p dm. 
Jy Jy i=0 Jy 

So 

imT^m) = WmWm) = II WITL~ (y , m) , 07) 

and therefore 

plnr(^T) = lnr(|^| p T), 

where the operator ipT acts on the space L p (Y,m), and the operator |?/>| P T acts on he 
space m). Substituting in (96) the function \ip\ p in place of ip and dividing the result 
by p we obtain (95). □ 

Remark 13.2 If p — > oo then formula (95) transforms into the formula 

lnr('0T) = max ( / \n\ib\du I . 

VeMp(Y,m) \Jy J 

This restores the Variational Principle for the space L°°(Y,m) (cf. Introduction). 

Abstract weighted shifts. Axiomatization. The study of the weighted shift oper- 
ators naturally needs the usage of complex Banach algebras rather than the real ones. 
Note that the statement of Theorem 6.2 is valid not only for the base algebra but also 
for any semisimple commutative Banach algebra with an identity (see [40]) (we recall 
that a commutative Banach algebra is called semisimple if it has zero radical, that is 
the intersection of all maximal ideals is zero; in this case the Gelfand transform is an 
isomorphism). Consideration of the weighted shift operators in L P (Y, m) spaces presented 
above and, in particular, their interrelation with the naturally arising transfer operators 
(see (92) and (97)) and (94) along with the mentioned description of endomorphisms of 
semisimple commutative Banach algebras makes it natural the introduction of the follow- 
ing axiomatization of the weighted shift operators acting in L p type spaces. 

Definition 13.3 Let B C L(B) be a semisimple commutative subalgebra of the algebra 
L(B) of all linear continuous operators acting on a Banach space B and containing the 
identity operator 1; and let 8 be an endomorphism of B such that 6(1) = 1 (and hence 
by Theorem 6.2 having the form (35)). Let C be a certain functional base algebra on X 
(where X is the maximal ideal space of B) containing functions of the form Re ip for all 
(p E B, and such that (C, 5) is a C*-dynamical system. Let also A : C — > C be a certain 
transfer operator for (C, 5). 

We will say that an operator T e L(B) is an abstract shift operator (associated with 
5 and A) and ipT, ip e B, is an abstract weighted shift operator (in a space of L p type, 
1 < p < oo), if 

a) the equality Tip = 5(ip)T, p e B holds and 

b) it holds the identity 

IK^1L w HI(^MTi||^ v^s- (98) 
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Remark 13.4 1) Since C is a selfadjoint part of a C*-algebra it follows that if \(p\ G C 
then \{p\ p G C for every 1 < p < oo. 

2) The model example presented above is a special case of a general scheme and 
therewith the complexity of the spectral radius calculation in the general scheme is equal 
to that of a model example (cf. (98) and (97)). 

3) Recalling Remarks 7.4 4) and 7.7 3) we have to stress that in general given a 
concrete functional algebra and its endomorphism then a transfer operator is not neces- 
sarily associated with a point mapping of the domain and therefore abstract shift operator 
and abstract weighted shift operator do not have to originate from any mapping of the 
domain. 

4) Observe that for C = C(X) and any positive left inverse A to S (that is A is a 
conditional expectation operator) there exists a realization of the objects mentioned in 
Definition 13.3 (for p = 1). Indeed, let E = C(X)*, T = A* and for any if; G C{X) we 
define the operator ip: C(X)* — > C(X)* by the formula 



where [ipf](x) = ip(x)f(x) in the right-hand part. 

Routine check shows that for ip and T defined in this way all the conditions of Defi- 
nition 13.3 (for p = 1) are satisfied. 

5) If A is an invertible conditional expectation operator in C then \\Af\\ = \\f\\, f G C 
(recall the reasoning in the proof of Proposition 8.3) and therefore formula (98) transforms 
(for any p) into the formula 



Precisely according to this formula there was calculated the norm IK^T)^ in the process 
of deducing the variational principle (4). 

Remark 13.5 We would like to emphasize that the construction of an appropriate trans- 
fer operator A in the model example, namely, the operator by means of which one can 
calculate the norm of (ipT) n with the help of formula (97) shows that the operator re- 
quired should contain information as on the initial measure m so also on the interrelation 
between this measure and the mapping (3. Therefore the choice of A in Definition 13.3 
reflects in essence an abstract way of recording the corresponding information. 
To clarify this remark we present the following observation. 

Let T be the shift operator on L p (Y,m) defined in Model Example. And let £ be the 
partition of Y formed by the inverse images of f3, that is 



We denote by £(x) the element of £ containing x. Consider the canonical factor space 
(Y^, 21^, m^) corresponding to the partition £ and the set of canonical conditional measures 
m T (y), where r = for some x. The measures m T are probability measures (that is 
m T (r) = 1 for each r) and are defined by the equality 



ZeC{xy, feC(x), 




(99) 



t = {p-\v)}v&r- 
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(the details see, for example, in [42], 1.5.) 

Define the conditional expectation operator E in the space L°°(Y, m) by the formula 

[Ei/>](x)= f ^{y)dm p ~ 1{ - x \y). (100) 
Jp-Hx) 

It is clear that if < ip G L°°(Y, m) then E(ip) > and E satisfies the homological 
identity 

E(((poP)ip) =(pEip for all ip,tp G L°°(Y,m). (101) 

One may note also that El = 1, which implies (in view of (101)) that E is a conditional 
expectation operator (in the sense of Definition 6.5, where as the base algebra C we take 
the algebra of real- valued functions in L°°(Ym)). 
Let j3{m) be the measure defined by the equality 

(3(m)(G) := m((3~ 1 (G)), G G St, 

and let df3{m)/dm be the Radon-Nicodim derivative of j3{m) with respect to m. Inequality 
(89) implies that d(3(m)/dm G L QO (Y,m) and its norm does not exceed C. Clearly, the 
operator A in (91) satisfies the equality 

A = W^E, (102) 
dm 

which describes the subtle interrelation between A, m and (3. 

The Variational Principle for transfer operators obtained in the foregoing sections and 
the reasoning in the proof of Theorem 13.1 lead to the next variational principle for the 
abstract weighted shift operators. 

Theorem 13.6 [Variational Principle for the abstract weighted shift operators 
in L p type spaces] Let ipT be an operator satisfying equality (98) (in particular, ipT 
can be an abstract weighted shift operator described in Definition 13.3). For the spectral 
radius of ipT the following variational principle holds 

\nrN)T)= max (I ln|^|c^+^), (103) 

^M S (C) \J X p J 

where X is the maximal ideal space of B. 

Proof. Recalling (73) and (98) and applying Theorem 11.2 we conclude that 

\nr(ipT) = -£(H P ) = - max ( f In U\ p dp + tU) J 

p p tieM s {C) \J X J 

= max ( [ ln\4>\dfi + ^) . □ 

^M S {C) \Jx V J 
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Remark 13.7 1) If in Definition f3.3 A: C — > C is an invertible conditional expectation 
operator then in view of Proposition 8.3 the Variational Principle established coincides 
with the variational principle (4). 

2) If p — > oo then formula (103) transforms into formula (4) and this restores formally 
the variational principle (4) for the spaces of C(X) and L°°(X) type for an arbitrary 5 
and any transfer operator A for (C,5) (cf. Introduction). 

3) To continue the previous remark we observe that if instead of A mentioned in 
Definition 13.3 one takes a certain other transfer operator A : C — > C such that A = A°(p 
with some < (p G C (for instance in the Model Example considered above one can set 
A = E and ip = (df3(m) / dm) of3 (see (102) and (101))) then looking through the proof 
of the theorem we see that formula (103) transforms into 

lnr(^T)= max ( [ ( In \^\ + d^i + 

neM a (c) \J X V P J P J 

If p — > oo then this formula also transforms into formula (4). So weighted shift operators 
in C(X) and L°°(X) spaces 'do not care' about transfer operators. 

4) In the proof of Theorem 13.1 we have used only condition b) of Definition 13.3 and 
have not exploited condition a) at all. We have inserted condition a) into Definition 13.3 
simply to emphasize the relation between the shift operator T and endomorphism 5. 

5) Recalling Remarks 13.4 3) and 7.4 4) we observe that even in the standard space 
L 1 (y, m) an abstract weighted shift operator in general is not generated by any measur- 
able mapping [3 as in the Model Example considered above. So even in this situation 
Theorem 13.6 is a generalization of the corresponding result form [16]. 
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